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Glossary  of  Frequently  Used  Terminology 

ALCR  -  Average  Level-Crossing  Rate 

APD  -  Amplitude  Probability  Distribution 

BFSK  -  Binary  Frequency-Shift  Keyed 

BPSK  -  Binary  Phase-Shift  Keyed 

CCV  -  Code  Connection  Vector 

Ch.f.  -  Characteristic  Function 

DPSK  -  Differential  Phase-Shift  Keyed 

E^/Nq  -  Bit  Energy  to  Noise  Spectral -Density 

E  /N  -  Symbol  Energy  to  Noise  Spectral  Density 
s  u 

GSNR  -  Generalized  Signal-to-Noise  Ratio 

ICS  -  Interactive  Communi cat ions  Simulator 

ISI  -  Intersymbol  Interference 

U<S  -  Least  Mean-Square 

ML  -  Maximum  Likelihood 

MSK  -  Minimum  Shift  Keyed 

p.d.f.  -  Probability  Density  Function 

-  Bit  Error  Probability 

P  -  Symbol  Error  Probability 
e 

QPSK  -  Quadrature  Phase-Shift  Keyed 

VA  -  Viterbi  Algorithm 

ZMNL  -  Zero-Memory  Nonlinearity 


I .  Introduction: 


In  synchronous  carrier-modulated  digital  communication  systems  the  main 
impediments  to  reliable  transmission  are  intersymbol  interference  (ISI),  add¬ 
itive  channel  noise,  and  errors  associated  with  the  ancillary  functions  of 
phase  tracking  and  symbol  synchronization.  For  single-element  systems,  re¬ 
ceiver  structures  have  been  devised  [ 1 ]  —  [ 3 3  which  deal  with  these  issues  in 
an  optimum  manner.  Specifically,  under  appropriate  modeling  assumptions,  the 
reception  problem  is  posed  as  one  of  maximum-likelihood  (11L)  sequence  esti¬ 
mation.  This  assures  that  the  receiver  is  optimum  in  the  sense  of  minimizing 
the  sequence-error  probability,  provided,  of 'course,  that  all  sequences  are 
equally  likely.  By  now  these  single-element  receiver  structures  have  been 
thoroughly  researched  and  are  quite  well  understood. 

In  an  increasing  number  of  important  applications,  however,  one  of  the 
major  impediments  to  reliable  communi cat ions  is  the  effect  of  spatially  dis¬ 
tributed  interference.  This  may  be  due  to  hostile  electronic  countermeasures 
(ECM)  or  Jamming,  or  unintentional  radio-frequency  interference  (RFl).  With 
the  addition  of  a  multi-element  array  capability  it  is  possible  to  provide 
spatial  filtering  in  an  effort  to  combat  this  interference  by  exploiting  its 
spatially  distributed  nature.  It  is  of  some  interest  then  to  explore  the 
structure  and  properties  of  ML  sequence  estimating  receivers  under  the  assump 
tion  of  spatially  distributed  interference  and  the  availability  of  an  adap¬ 
tive  multi-element  array  capability. 

The  present  report  describes  some  preliminary  work  directed  toward  the 
development  of  optimum  adaptive  multi-element  receiver  structures  for  digital 
communications  in  the  presence  of  spatially  distributed  interference. 


Specifically,  ve  describe  an  optimum  receiver  structure  under  specific 
modeling  assumptions  for  ML  estimation  of  a  linear  carrier-modulated  binary 
data  sequence.  This  optimum  structure  consists  of  an  interference  suppress¬ 
ion  spatial  filter  followed  by  a  matched  filter  sampled  at  the  baud  rate, 
and  a  recursive  nonlinear  processor  employing  the  Viterbi  algorithm  (VA)  to 
perform  ML  sequence  estimation.  The  result  is  a  practically  implementable 
optimum  spatial/temporal  processor  which  provides  a  significant  multi-element 
extension  of  previous  single-element  receiver  structures  described  in  [2], [3]. 

Some  preliminary  modeling  assumptions -are  provided  in  Section  II.  This 
is  followed  by  the  development  of  the  optimum  ML  receiver  structure  in  Section 
III.  The  VA  sequence  estimator  is  described  in  Section  XV.  An  adaptive  imple¬ 
mentation  of  the  ML  receiver  structure  is  described  in  Section  V.  Finally,  a 
summary  and  conclusions  can  be  found  in  Section  VI. 

II.  Preliminaries: 

In  what  follows  we  will  make  exclusive  use  of  complex  signal  and  noise 
representations.  Specifically,  the  received  signal  at  the  output  of  the  i’th 
array  element  is  assumed  of  the  form 

r^t)*/?  Re{ri(t)e’327Tfct}  ;  i=l,2,...,N,  (l) 

where  f  is  an  assumed  known  carrier  frequency  in  Hz  and  ?_.  (t)=r__.  (t)-Jr__.  (t) 

C  1  Cl  SI 

is  the  corresponding  complex  envelope.  The  quantities  rci(t)  211(1  rg^(t)  8X6 
lowpass  waveforms  representing  the  inphase  and  quadrature  (I/Q)  components, 
respectively.  We  assume  that  r^(t)  can  be  expressed  in  the  form 

?i(t)=Sio(t  »£.,T ,  4>)+Si  (t )  ;  i=l,2,...,N,  (2) 

where  n^(t)  is  the  total  additive  noise,  consisting  of  both  spatially  distri¬ 
buted  interference  and  thermal  element  noise,  at  the  output  of  the  i’th  element 


and  s^Q(t ;£,T ,4>)  is  the  corresponding  signal  component.  The  quantity^  £  is 
a  complex  vector  of  dimension  2m+1  representing  the  binary  data  sequence  to 
be  transmitted  in  a  manner  specific  to  the  modulation  strategy  in  use  vhile 
T  and  4>  represent  unknown  timing  delay  and  phase  offset,  respectively.  We 
suppose  that  the  complex  envelope  of  the  signal  component  can  in  turn  be 


expressed  as 


Si0(t;£,T,<J>)  =  |*hi0(t-o)S(o;c,T,*)dO  ;  i=l,2,. . . ,H. 


Here  I(t;£,T,<J>)  is  the  transmitted  signal  component  as  it  would  appear  at  the 
output  of  an  ideal  isotropic  sensor.  The  quantity  h^^(t)  in  (3)  represents 
the  complex  impulse  response  of  the  signal  path,  including  channel  filtering 
effects  and  propagation  across  the  array  aperture.  For  example,  if  no  channel 
filtering  effects  are  present  hiQ(t  )=6(t-Ti0)e^27rfcTiO  where  TiQ  is  the  re¬ 
lative  delay  in  receiving  the  signal  component  at  the  i'th  sensor  measured 
•■with  respect  to  some  fixed  phase  center  or  origin  of  spatial  coordinates. 

The  impulsive  component  accounts  for  the  group  delay  of  the  complex  envelope, 
while  the  complex  expoential  accounts  for  the  relative  phase  shift  of  the 

carrier.  The  quantities  tiQ,  i=l,2,...,N  can  generally  be  calculated  from 

knowledge  of  the  signal  direction  and  the  assumed  array  geometry.  For 

example,  consider  the  linear  array  illustrated  in  Fig.  1  with  signal  direction 

indicated  by  the  angle  0  measured  with  respect  to  boresight.  If  the  distance 

s 

in  wavelengths  of  the  i'th  element  from  the  origin  is  d^  then  TiQ=±d^sin0g, 
i=l,2,...,N  with  the  algebraic  sign  depending  upon  which  side  of  the  phase 
center  the  element  lies.  In  what  follows,  we  assume  that  the  signal  direction, 


In  what  follows  all  vectors  are  to  be  considered  column  vectors  so  that 
s.  =s(c_m’c-M+1’,",C0’**,,CM-1’CM)’ 


6g  in  Fig.  1,  is  known  a  priori  although  this  could  he  included  in  addition 

to  T  and  4>  as  a  parameter  to  he  estimated. 

In  general,  some  channel  filtering  is  present  so  that  the  signal  path 

impulse  response  function  can  he  represented  as  h.„(t)=h  (t-T* „)e'^27rfcTiO, 

lO  c  iO 

where  h^(t)  is  the  channel  impulse  response  common  to  all  elements.  For  the 

channels  of  interest,  the  bandwidth  B  will  always  he  such  that  B  T.rt«l  and 

c  c  iO 

hence  h^t-T^^h^t) ,  i=l,2,...,H.  Under  this  assumption  h^^(t)= 

hc(t)eJ2  cTiO,  i=l,2,...,N,  which  will  he  the  form  used  in  what  follows. 

The  transmitted  signals  of  interest  will  he  assumed  of  the  form"*" 

M 

S(t;£,T,<J>)=  l  c  g(t-mT  -x)eJ®  -(M+1/2)T  £t<(M+l/2)T  ,  (U) 

m=-M  s-  s 

where  T  is  the  channel  signaling  element  or  baud  interval  and  g(t)  is  a 
s 

complex  lowpass  pulse-like  waveform  vanishing  outside  the  interval 
-T  /2<t<T  /2.  It  follows,  according  to  preceding  comments,  that 

S  “  "  s 

M 

si0<-fc*,c_,T,4>)=  l  c  h(t-mT  -x)ej(*+<W  ;  i=l,2,...,B  ,  (5) 

m=-M 

where  ti0  3  an<i  ^(t)  =  represents  the  convolution  of 

the  baseband  pulse  waveform  and  the  channel  impulse  response. 

Consider  now  the  additive  noise  term  n^(t)  in  (2).  We  assume  this  can 
he  written  in  the  form 

ni(t)=yio(t)+Wi(t);  i=l,2,...,N  ,  (6) 

where  y^g(t)  represents  spatially  distributed  interference  at  the  output  of 
the  i’th  element  and  w^(t)  is  a  complex  zero-mean  white  Gaussian  noise  (WGN) 
process  assumed  independent  from  element-to-element.  Specifically, 

t  Note  that  we  are  restricting  attention  to  linear  carrier  modulation  formats. 
Certain  offset  quadrature  modulation  schemes  such  as  MSK  do  not  fit  into 
this  formulation. 
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V  (t)=v  (t)-jv  (t),  with  the  I/Q  components  w  .(t)  and  v  .(t)  mutually 

1  Cl  SI  Cl  SI 

independent  WGN  processes  with  double-sided  noise  spectral  density  NQ/2 
watts/Hz.  We  assume  that  y\(t)  can  "be  represented  in  the  form 

*i°(t)  =  5  1=1  ’ 2 . K  •  (7) 

Here  y^ (t )  represents  the  j’th  interference  component  originating  from  some 
unknown  direction.  For  example,  in  the  case  of  a  linear  array,  the  direction 
of  the  J'th  interferer  is  indicated  "by  the  off-boresight  angle  6  as  illust- 

J 

rated  in  Fig.  1.  The  quantity  represents  the  relative  delay  experienced 
by  the  j'th  interfering  signal  as  intercepted  at  the  i'th  element,  i=l,2,..., 
j*l,2,...,J.  Again,  in  the  case  of  a  linear  array  we  have  T.  =±d!sin9,. 

i  j  i  j 

Similarly,  the  phase  factors  4^  in  (7)  are  given  by  <f>ij=2rrfcTij . 

The  individual  interfering  signals  y^(t),  J=1,2,...,J  are  generally 

complex  broadband  processes.  We  will  make  the  simplifying  assumption  in 

what  follows  that  these  quantities  are  all  complex  mutually  uncorrelated 

zero-mean  Gaussian  processes  with  covariance  functions^ 

K'j  j  (t,»)  ■  E^(t)yj(s)}  ;  J=1,2,...,J  .  (8) 

J  J 

It  follows  that  the  total  noise  power  rL(t)  at  the  output  of  the  i'th  sensor 

is  Gaussian  with  cross-covariance  function 

K-  ~  ( t , s )  =  K~  ~  (t,s)+N  6(t-s)  ;  i=j 

i  J 


=  K~  ~  ( t ,  s ) 
yiyj 


;  i^j 


where 


This  information  will  be  made  use  of  in  what  follows. 


+  We  have  tried  to  indicate  complex  scalar  quantities  with  a  tilde 

Covariance  functions, system  transfer  functions  and  all  matrix  quantities 
will  be  implicitly  assumed  complex  so  that  the  tilde  will  be  eliminated. 


Ill .  Maximum-Likelihood  Receiver  Structure: 

The  element  output  signal  can  he  expressed  in  vector  form  as 

?(t)  =  |0(t;£,T,4))+n(t)  (ll) 

where  r_  (t  )=(r^(t)  ,fg(t) .  ,fjj(t) )  with  s^t  ;S,T , )  and  n(t)  obtained  similar¬ 
ly  from  the  corresponding  elements  of  ?(t).  Given  the  quantities  £,t,4>,  and, 
of  course,  the  signal  phase  factor  ^=(4>j  0,<J>2 0 , . . .  ,  the  signal  vector 

s^(t ;c_,T,(j>)  is  completely  known"*".  The  quantity  r(t)  is  then  a  complex  Gaussian 
process  with  mean-value  function  s^(t ;c_,T ,$)  and  covariance  matrix  K~~(t .s )= 

K-~(t,s)  where 
=nn 

gS5(t,s)=E{5(t)n+(s)}=g^(t,s)+R0  I  6(t-s)  .  (12) 

Here  I  is  the  NxH  identity  matrix  and  K~~(t  ,s)=E{£0(t )^( s) }  is  the  covariance 

«*T  _  — 

matrix  associated  with  the  vector  interference  process  y^(t )=(y1p(t) ,y2Q(t) , 


. . .  •  From  previous  comments 


Syy(t  >S  )  = 


KV‘<t,s> 


K-  -  (t,s)  . K~  -  (t,sf 

yiy2  yiyN 

Ks.s  (t,s)  .  K~  -  (t,s) 


K-  -  (t,s)  . K~  -  (t,s) 


,  (13) 


with  the  cross-covariance  terms  K-  -  (t,s)  given  by  (10). 

yiyJ 

As  a  relatively  straightforward  application  of  known  results  (cf.  [U], 
[5]),  it  can  be  shown  that  the  log-likelihood  functional  for  observing  r(t) 
given  fixed  values  of  the  parameters  5,  t,  and  $  is  given  by 


t  The  signal  path  overall  impulse  response  function  h(t)  is  also  assumed 
known. 

ft  Here,  and  in  what  follows,  the  symbol  "t"  will  represent  complex  con¬ 
jugate  transpose. 
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£{r(t)  ;c.,t,4>}=  Rej  j  j  r  (t)gg^(t,u)i<)(u;£,T,<{))dtdu| 


where  g^~(t  ,u)  is  an  HxN  Hermit i an  matrix,  called  the  inverse  covariance  matrix 


satisfying 


j™  Kg£(t ,u)S~~(u,s)du  =  I  6(t-s)  . 


Substitution  of  (5)  into  the  argument  of  the  first  term  on  the  right-hand  side 


of  (lit)  yields 


J"  j*  f+(t)g£~(t,u)5<;)(u;c_,T,4>)dtdu 


“  L£.  , 


where  r  is  now  a  vector  of  length  2M+1,  with  components  defined  according  to 

?m  *  j*  j°°  rT(t)Q*~(t,u)h*(u-mTs-T)e~^|^dtdu  ;  m=0,±l, . . .  ,±M.  (IT) 

—00  —00 

Here^  £q  is  a  complex  N-vector  defined  according  to  j^=(e-,^10,e  0 , . . .  .e'^N 

After  some  simple  algebra  it  can  be  shown  that  ?m  can  be  expressed  in  the  form 

rffi  *  j*  rT(t)e'J<,,G*(t-(mTs+T))|0dt 


=Sm 


:  . . a  , 


where 


L  =  [5‘(-t)®{r(t)e"'3<,>}) 


t=mT  +T 
s 


;m=0,±l,±2, . . . ,±M  ,  (19) 


t  As  we  will  see  is  merely  a  beam-steering  vector  which  allows  pointing 
a  beam  in  the  known  signal  direction. 


with 


a 


r 


G(t)  =  g^~(t-u)h(u)du  ;  -cc<t<“, 


(20) 


and  we  have  implicitly  assumed  here,  and  in  what  follows,  that  the  interfer¬ 
ence  process  y_(t)  is  wide-sense  stationary  (w.s.s.)  so  that  K — (t ,u)=K— (t-u) 

'J  =nn  *1 in 

and  similarly,  at  least  for  long  observation  intervals,  that  S~~(t ,u)=S^~(t~u) . 
Equivalently,  it  can  be  shown  that  G(t)  satisfies  the  matrix  integral  equation 


r 


K~~(t,s)G(s)ds=I  h(t)  ; 

=xui  —  = 


(21) 


* 


The  quantity  m=0,  1,  2, —  ,  M  in  (19)  has  an  interpretation  as  the  sampled 

output  of  a  multi-channel  filter  with  impulse  response  matrix  G^(-t)  and 
-/  >  —Id) 

input  r(t)e  °  .  Sampling  is  assumed  to  take  place  at  the  times  t=nT  +T, 

" "  s 

mss0,±l,±2, . . .  ,±M.  The  vector  x^  is  then  weighted  by  the  beam-steering  vector 
to  form  rffl  according  to  (l8).  This  weighting  effectively  orients  the 
array  of  elements  to  point  in  the  known  signal  direction.  A  block  diagram 
of  a  scheme  for  generating  the  sequence  {?m)  is  then  as  illustrated  in  Fig.  2. 
Note  that  the  beam  steering  is  accomplished  at  the  output  of  the  multi-channel 
filter  and  not  the  input  as  in  most  conventional  adaptive  array  systems.  It 
should  also  be  noted  that  in  the  absence  of  spatially  distributed  interference, 
or  indeed  merely  the  absence  of  correlation  across  the  array  elements,  (2l) 
is  trivially  solved  with  G(t)  proportional  to  a  diagonal  matrix  with  h(t)  along 
the  main  diagonal. 

In  a  similar  manner,  it  can  be  shown  that  the  integral  appearing  in  the 


second  term  in  (lU)  can  be  expressed  as 

r 00  (' 


§^(t;a,T,$)gftft(t,u)|<3(u;5,T,<^)dtdu=2+H  5_ 


(22) 


_QD  _0O 


where  H  is  an  Hermitian  matrix  of  size  (2M+l)x(2M+l)  independent  of  the 
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observations  with  (m,n)  element  h  given  by 

m-n 

hm-n=^0  {  j  h#(t-mTs-T)gflfl(t-u)h(u-nTs-T)dtdujc0;  m,n=0,±l, 

-<*>-<*>  ...  ,+M.  (23) 

Again  we  assume  that  all  quantities  in  (23)  are  known  so  that  H  in  (22)  is 

readily  evaluated. 

It  follows  from  the  preceding  that  the  log-likelihood  functional  in  (ll) 
can  then  be  expressed  in  the  form 

*{?(t);c,T,$}=Re{r+e}-(l/2)c.+g  2 

=  (l/2){r 1  c+c^r-c/ H  c_}  (2U) 

=  ( 1/2 ) { r+g-1r- ( c_-g-1r ) +g ( c_-g-1r ) } 

where  the  last  equality  is  easily  verified  through  simple  vector-matrix 
algebra.  The  objective  then  is  to  choose  estimates  c_,t  and  <j>  to  maximize 
this  quantity.  It  should  be  clear  from  (2k)  that  the  vector  £,  representing 
the  sequence  of  outputs  from  the  processor  illustrated  in  Fig.  2,  provides 
a  sufficient  statistic  for  extracting  all  the  parameters. 

The  system  illustrated  in  Fig.  2  for  generating  the  sequence  of  observ¬ 
ables  {?m}  provides  combined  spatial/temporal  processing  in  the  single  multi¬ 
channel  filter  with  impulse  response  matrix  G^(-t).  It  may  be  desirable  in 
some  cases  to  separate  the  spatial  and  temporal  processing  to  the  extent 
possible.  Indeed,  this  is  generally  the  case  in  conventional  array  process¬ 
ing  systems.  This  can  be  accomplished  by  observing  that  ?  in  (17)  can 

m 

alternately  be  expressed  in  the  form 

?  ■  [h»(-t)  ®{x'(t)e-J*}]  ;m=0,±l , . . .  ,±M,  (25) 

t=mT  +t 
s 

X’(t)  =  Sj[gftft(t)0?(t)]  ; 


where  now 


_00<t<<». 


(26) 


The  term  within  brackets  in  (26)  represents  again  a  combined  spatial/temporal 
multi-channel  filtering  operation  although  the  common  "matched  filtering" 
operation,  represented  by  the  convolution  with  h*(-t),  is  now  performed 
separately  at  the  array  output.  This  is  illustrated  in  block  diagram  form 
in  Fig.  3  where  the  spatial/temporal  processor  in  the  array  forming  network 
now  has  impulse  response  matrix  g~~(t). 

In  either  case,  the  multi-element  ML  processor  then  chooses  the  estimated 
sequence  5  to  maximize  the  log-likelihood  functional  in  (2*0  while. the  timing 

<A  A 

and  phase  estimates,  T  and  6  respectively,  solve  the  coupled  equations 

3£{r(t)  ;£,T,<t>) 

- * - =  0  (27a) 


3£{r(t)  ;c,t,4>) 


(27b) 


Clearly  then  c_  is  chosen  as  the  "legitimate"  sequence  minimizing  the  perform¬ 


ance  measure 


J(e)  =  (c-r1£)+g(i-H“1r). 


C  -1- 


Note  that  the  solution  is  not  merely  cfg  £  since  this  is  in  general  analog¬ 
valued  and  need  not  be  a  legitimate  data  sequence.  As  shown  in  [2],  the 
operation  H-"*"  represents  an  equalizer,  in  this  case  attempting  both  to  equal¬ 
ize  the  effects  of  ISI  and  spatially  distributed  interference.  The  convention- 

A 

al  approach  to  obtaining  is  to  pass  the  observation  sequence,  represented 
in  this  case  by  r.,  through  an  equalizer  and  then  quantize  the  output  on  a 
symbol-by-symbol  basis  to  the  closest  valid  data  symbol  value.  In  the  next 

A 

section  we  describe  how  the  VA  can  be  used  to  choose  the  ML  estimate  £.  First 

Ai  A 

we  show  how  the  estimates  T  and  $  can  be  chosen  in  a  decision-directed  manner 
to  satisfy  the  ML  equation  (27). 
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Suppose  that  a  sequence  of  observations  each  of  total  length  (2M+l)T 

s 

are  made  by  sliding  the  observation  vindov  by  K  baud  at  each  iteration.  Let 

r^t)  represent  the  N-vector  of  observations  obtained  at  the  i’th  iteration, 

i.e.,  for  t  in  the  range  (iK-M-l/2)T  <t<(iK+M+l/2)T  .  Note  that  if  K=2M+1 

s -  s 

there  will  be  no  overlap  in  the  observation  windows  while  if  K=1  the  window 
is  merely  slid  by  one  baud  each  iteration.  At  any  rate,  according  to  (21*) 
the  log-likelihood  functional  is  given  by 

A{?i(t)  ;ci,T,(|)}=  Re{c^r^}  -(l/2)ct  g  c^  ,  (29) 


where  now  (riK_M»riK_M+1 » •  •  •  »riKf  •  •  •  »riK-+M 


,...,riK/..,riIC+M),  and  similarly,  £^(5.^, 


5iK-M+l’’ ‘ ' ’aiK** ' * ’5iK+M^'  Zt  follows  that 


t)8£i.T.*}  _  ,.+*  , 

- ^ > 


*l{h{ ti^.T,#)  „  ^gt-  }  > 

3<t> 


(30a) 


(30b) 


where  according  to  either  (18)  or  (25)  we  have  the  m’th  element  of  ?.  given  by 


i,m  dt  -*0  -  'O'  t*(iK+m)T  +T  ;m  0’±1»  ‘  ‘ '  >±M 


r.  *  frUM-t)®  {x'(t)e~J*}]  ;  m=0,±l, . . .  ,±M  ,  (31b) 

1  t=(iK+m)T  +t 


respectively.  In  either  case,  the  estimates  can  be  generated  recursively 


according  to  the  stochastic  gradient  schemes 


4>i+1  *  ♦i  +  c^Imtc^}  ;  i=0,l, 


(32a) 


t.  *  x .  +  B.,Re{c7r,};  i*0.1 . 


(32b) 


t  Again  x!(t)  represents  observations  on  x'(t)  over  the  window  (iK-M-l/2)T 
t<{ iK+M+l/2 )T  . 


Here  8^,  axe  arbitrary  and  the  positive  real  step  sises  {a^}  and  {B^,}  mist 

he  specified.  Note  that  the  recursive  estimators  described  by  (32)  are  de- 

* 

cision-directed  in  the  sense  that  estimated  sequences  {H_^}  are  actually  used 

in  place  of  the  actual  but  unknown  sequences  {c^}  .  The  resulting  decision- 

directed  structure  is  illustrated  in  Fig.  U. 

As  a  special  case,  suppose  we  use  non-overlapping  blocks  (i.e.,  K=2M+l) 

of  matched  filter  outputs  with 

00  00 

*,  X  a?  (33) 

i=0  1  i=0 

and  similarly  for  {B^}.  This  is  the  so-called  Robbins-Monro ■»  stochastic  approx¬ 
imation  algorithm.  Here  it  can  be  shown  that  convergence  to  the  true  ML  esti¬ 
mates  is  obtained  both  in  mean-square  and  with  probability  one.  In  practice 
it  may  be  desirable  to  update  the  timing  and  phase  errors  more  frequently,  say 
at  the  baud  rate  by  taking  K*l.  Convergence  is  not  assured  in  this  case. 


TV.  Vlterbi  Algorithm  Sequence  Estimator: 

A 

We  describe  new  how  the  VA  is  to  be  used  to  obtain  the  ML  estimate  jc 
of  a  transmitted  data  sequence  c.  For  the  moment  assume  that  the  m'th 
sequence*  c^  of  length  2M+1  is  transmitted.  According  to  (2U)  the  log- 
likelihood  functional  can  be  written  as 

A{r(t);5  )  *  Re(c*r)-(l/2)c '  g  5 

’  I„  I  Im  •  <*> 

where  for  simplicity  ve  have  neglected  the  symbol  timing  and  phase  tracking 
errors  T  and  $,  respectively.  The  quadratic  term  can  then  be  written  as 


+  This  notation  differs  somewhat  from  that  used  in  association  with  (29) 
and  the  sequel.  Hopefully  the  meaning  in  either  case  is  clear  from  the 

context . 
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iLm  jLm  |cni*  h0+2Be  I  J_M 

M  -  M  M+i  ~ 

-  X  |e  . |2h.+2Re  -  T  c«  7  h.  c  ,  . 
i-M  0  i=-M  k=l  ^  ffi*i-k 


We  assume  that  the  filter  memory  is  such  that  h^cO  for  k>K,  where  K«M.  It 


follows  that 


M  r  r  min(K-l,M+i)  ^ 

[se{5^i)-(l/2)|5mll!V!i'{5^  ^  •  (36> 


or  equivelently. 


i{r(t);5  }  =  l  X\ 
^  i*=-M  1 


where  X.a  is  the  "branch  metric"  associated  with  the  i'th  branch  along  the 


m'th  path  given  by 


f  \  &  /  min(K-l ,M+Z)  \  fno\ 

41  J,  V„.i-k}  •  <38> 

Vote  that  evaluation  of  the  branch  metric  xj^  requires  knowledge  of  at  most 

K-l  past  data  symbols,  i.e.,  c  .  .  ,k=0,l, . . .  ,min(K-l,M+£) .  Indeed,  the 

in  j  Af*»ic 

summation  in  the  last  term  of  (38)  represents  a  finite-impulse  response  (FIR) 
filtering  operation  which  can  be  implemented  as  a  transversal  filter.  A 
scheme  for  evaluating  this  branch  metric  is  illustrated  in  Fig.  5* 

The  totality  of  possible  paths  or  data  sequences  can  then  be  described 
in  terms  of  a  trellis  structure  defined  in  terms  of  the  K-l  most  recent  symbols. 
Define  the  cumulative  path  metric  up  to  level  k  in  the  trellis  according  to 

VMJ  -  XM  x*°  •  (39) 

with  d  w  ,(r,c  )*0  for  all  m.  The  VA  then  searches  the  trellis  attempting 

— M— A  -—  — m 

to  find  that  path  which  maximizes  the  cumulative  path  metric  through  level  M 


of  the  trellis.  A  block  diagram  illustrating  the  operation  of  the  ML 
sequence  estimator  employing  the  VA  is  then  illustrated  in  Fig.  6.  Act¬ 
ually,  there  is  no  need  to  impose  a  block  structure  on  the  VA  sequence 
estimator.  Specifically,  the  scheme  can  be  made  synchronous  with  the  baud 
rate  by  outputing  a  delayed  symbol  estimate  c^  ^  during  the  i’th^baud 
interval  with  D»K  an  appropriate  delay.  The  estimate  cL  ^  is  selected  as 
the  symbol  in  the  corresponding  position  along  the  surviving  path  with  the 
largest  cumulative  path  metric  at  level  i.  This  is  a  relatively  straight¬ 
forward  application  of  the  VA  as  used  in  practice. 


V.  Adaptive  Implementation: 

In  the  preceding  we  have  assumed  explicit  knowledge  of  the  complex 
channel  response  Ec(t),  or  equivalently  S(t),  as  well  as  the  covariance 
matrix  £^(t,s)  of  the  interference  component  as  described  by  (13).  In 
particular,  this  allowed  evaluation  of  the  spatial/temporal  matched  filters 
in  either  Fig.  2  or  Fig.  3,  as  well  as  the  sampled  channel  correlation  matrix 
H  defined  through  (23).  Recall  that  this  latter  quantity  was  required  for 
calculation  of  the  branch  metrics  in  the  VA  as  illustrated  in  Fig.  5.  Un¬ 
fortunately,-  information  on  S(t)  and  K— (t,s)  is  not  generally  available  and 

~yy 

one  must  then  seek  an  adaptive  implementation  of  the  ML  receiver  structure. 

One  adaptive  implementation,  suggested  by  the  approach  in  [3],  is  to 
approximate  the  unknown  spatial/temporal  filtering  operations  by  a  class  of 
finite-parameter  filters  whose  paremeter  values  can  be  adaptively  adjusted 
to  approximate  the  desired  response.  There  are  two  different  approaches  here 
depending  upon  whether  the  implementation  in  Fig.  2  or  that  in  Fig.  3  is 
employed.  In  the  latter  there  must  be  two  adaptive  loops ;  one  to  adjust 
elements  of  the  temporal  matched  filter  h*(-t),  and  a  second  to  adjust 


elements  of  the  multi-channel  spatial /temporal  filter  represented  hy  Q— (t). 

-nn 

The  first,  strictly  temporal  loop,  can  he  implemented  exactly  as  in  [3], 
while  the  second  spatial/temporal  processor  can  be  implemented  using  con¬ 
ventional  adaptive  array  techniques  [6],  [?].  Specifically,  the  matrix  filter 
§25 (t)  in  Fig.  3  is  replaced  by  a  bank  of  transversal  filters  with  tap  spacing 
T  no  more  than  1/2B0  where  BQ  is  the  signal  bandwidth.  The  array  output 
signal  x' (t)  is  filtered  in  a  post-array  lovpass  filter  and  presented  as 
input  to  an  adaptive  algorithm,  such  as  the  least  mean-square  (LMS)  algorithm. 

A  reference  input  3(t)  can  be  obtained  in  a  decision-directed  fashion  as  the 
quantity^ 

A  AAA  M  A  A  A  *  1 

s(t;c,t,4>)  =  l  c  E(t-mT  -T)eJ?  ;  -(M+l/2)T  <t<(M+l/2)T  ,  ( ho) 

m=~M  s  5 -  s 

£  A  A 

where  {c^}  represents  ML  estimates  from  the  output  of  the  VA  and  T,<|>  are 

A 

corresponding  timing  and  phase  estimates.  The  quantity  E(t)  represents  an 
estimate  of  the  modulator/channel  impulse  response  E(t)  as  obtained  in  the 
first  adaptive  loop.  Note  that  the  VA  must  make  use  of  the  esti¬ 
mated  response  ggg(t)  in  computing  the  quantity  ^according  to  (23)  which 
is  required  in  branch  metric  computations.  Existing  adaptive  array  schemes 
generally  assume  that  S~~(t)  can  be  approximated  by  a  parallel  bank  of  trans¬ 
versal  filters  without  any  interacting  terms.  This  is  equivalent  to  assuming 
that  g~~(t )  is  diagonal^.  It  is  not  clear  how  this  assumption  degrades  poten¬ 
tial  performance  although  it  is  a  reasonable  assumption  for  wideband  inter¬ 
ference.  We  will  not  discuss  this  approach  further  since  it  is  a  relatively 
straightforward  application  of  existing  techniques. 

t  This  assumes  that  appropriate  delays  in  the  VA  sequence  estimator  are 
accounted  for. 

ft  Note  that  in  this  case  the  beam  steering  network  in  Fig.  3  can  be  inter¬ 
changed  with  the  spatial/temporal  filter,  as  is  the  case  in  most  existing 
adaptive  array  systems. 
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The  adaptive  implementation  of  the  scheme  in  Fig.  2  is  somewhat  more 


interesting  and  apparently  has  not  been  described  previously.  Here  we 
assume  that  the  filtering  operation  represented  by  G‘(-t)  can  be  approxi¬ 
mated  by  a  network  of  transversal  filters,  each  of  length  L  with  tap  spacing 


T  no  more  than  1/2Bq  seconds.  Specifically,  the  system  transfer  matrix  of 


this  approximating  network  is  diagonal  of  the  formT 

Htawii,.)  o 


E(jw)  = 


(i*D 


where  jL  ,  i=l,2,...,N  is  an  L-vector  representing  the  weights  of  the  corres¬ 
ponding  transversal  filter  such  that 

-j  (L-l)ioT 


I  g,  »e 

&=1 


-j(L-l)wT 


As- .noted  previously,  the  beam  steering  network  in  Fig.  2  can  then  precede 


the  spatial/temporal  processor.  Let  ?^(t),  i=l,2 . N  represent  the  corres 


ponding  input  to  this  processor  and  define  the  L-vectors 


r'(t) 


r^(t-T) 


ur’(t-(L-l)T) 


;  i=l,2, . . . ,N 


(1*3) 


Finally,  define  the  vector  ?'(t)  of  dimension  NL  as 


t  It  can  be  shown  that  no  loss  of  generality  results  from  the  assumption 
of  diagonal  form. 


Then  the  output  signal  r(t)  in  Fig.  2  is  given  by 

r(t)  *  £T  ?' (t) 

where 


Similarly,  the  temporal  filter  n(t)  is  approximated  by  a  transversal  filter 
of  length  2L'+1  with  corresponding  tap  vector  £  s(q_L, ,q_L+1» . . . ,qQ,. . . ,5^, 
).  The  received  signal  estimate,  corresponding  to  (1*0),  is  then 

A  AAA  A  A 

s(t;c.,T,<|>)  *  l  c  fqf  ,  t=mT  +T  .  (1*7) 

ra-x  *■  s 

The  error  signal  e(t)=r(t )-cl(t )  is  then  formed  with  3(t)  set  equal  to  the 
decision-directed  signal  estimate  of  (1*7).  At  the  sampling  instants  then 


we  have 


e  =  e(mT+T) 

ZD. 

s  &z  -  a.  s 


and  it  is  possible  to  choose  both  £  and  £  to  minimize  the  quadratic  performance 


criterion 


=  E{|5al!>  • 


At  this  point  standard  stochastic  gradient  procedures,  such  as  the  LMS 
algorithm, can  he  applied  to  minimize  (^9)  with  respect  to  both  £  and  <i. 
The  details  remain  to  he  worked  out. 


VI.  Summary  and  Conclusions: 


We  have  described  a  novel  adaptive  multi-element  receiver  structure  for 


ML  sequence  estimation  in  a  spatially  distributed  interference  channel.  This 
structure  provides  adaptive  element  weighting  and  simultaneous  data  demodula¬ 
tion,  phase  tracking  and  bit  synchronization.  The  structure  differs  from  the 
conventional  approach  where  these  functions,  are  all  performed  separately  and 
independently.  A  present  weakness  of  the  result  is  the  assumption  of  Gaussian 
interference.  Hopefully  this  can  be  removed  in  future  work.  Nevertheless, 
the  present  structure  is  interesting  for  the  light  it  sheds  on  how  an  optimum 
processor  is  implemented.  In  particular,  the  adaptive  array  weighting  should 
be  combined  with  the  modulation  and/or  coding  into  an  integrated  spatial/ 
temporal  processor. 
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Operation  of  the  ML  Sequence  Estimator  Employing  the  VA 


PART  II 

REDUCED-SEARCH  SOFT-DECISION  TRELLIS 
DECODING  OF  LINEAR  BLOCK  CODES 


I .  Introduction : 


Several  authors  have  devised  decoding  schemes  for  linear  block  codes  which 
utilize  the  channel  measurement  information  contained  in  a  real-valued  decision 
statistic  available  at  the  output  of  a  receiver  [ 1 ]— [ 6 ] .  However,  most  of 
these  algorithms  are  either  unreasonably  complex  or  are  applicable  only  to 
restricted  classes  of  codes. 

Wolf  [6]  has  proposed  an  algorithm  which  has  an  interesting  representa¬ 
tion  as  a  search  for  the  maximum  likelihood  path  through  a  trellis  which  can 
be  defined  for  a  block  code.  This  scheme  applies  the  Viterbi  algorithm  (VA) 
to  perform  the  search  and  has  been  shown  to  perform  maximum-likelihood  decod¬ 
ing.  Unfortunately,  the  computational  complexity  of  the  VA  increases  with 
the  width  of  the  trellis  to  be  searched.  In  its  application  to  the  decoding 
of  convolutional  codes,  the  VA  becomes  computationally  unfeasable  for  codes 
of  constraint  length  greater  than  10  or  12.  There  is  a  similar  limitation 
when  the  VA  is  used  for  maximum  likelihood  decoding  of  linear  (n,k)  block  codes 
over  GF(a).  The  width  of  the  trellis  in  this  case  can  be  as  high  as 
q^n  and  the  VA  is  too  complex  for  many  codes  of  even  moderate  length. 

This  paper  is  concerned  with  the  development  of  a  reduced-search  algorithm 
which  is,  in  a  sense,  a  generalization  of  the  VA.  The  complexity  of  this 
algorithm  is  a  function  of  several  parameters  and  we  demonstrate  that  the 
algorithm  gives  performance  close  to  the  VA  when  applied  to  the  block  code 
trellis  defined  by  Wolf  at  the  seme  time  the  complexity  is  significantly  reduced. 
Related  algorithms  have  been  applied  in  maximum-likelihood  sequence  estimation 
on  intersymbol  interference  (ISI)  channels  [10],  [ 11 ] . 


Trellis  Decoding  of  Linear  31ock  Codes: 


When  applied  to  the  maximum-likelihood  decoding  of  an  (n,k)  linear  block 


code,  the  VA  is  used  to  search  a  trellis  which  is  defined  in  terms  of  the 


parity  check  equations  satisfied  by  the  2  code  words  of  the  code.  The 
decoder  attempts  to  track  the  "state"  of  the  partial  parity  check  sum: 

k 

S(k)  =  l  z  h . ,  k=0 ,1 , . . .  ,n  . 

i=l 

Here,  h^  represents  the  i'th  column  of  the  (n-k)  x  n  parity  check  matrix 
H  and  z^  is  the  i'th  entry  in  a  binary  row  vector  z_.  To  each  sequence 
z^ ,  i=l,2,...,n  there  corresponds  a  sequence  of  parity  check  sums 
S^(k) ,  k=l,2, . . .  ,n. 

We  can  use  a  trellis  diagram  to  represent  the  evolution  in  time  of 
this  sequence  of  parity  check  sums.  The  partial  parity  check  sum  can  be 
considered  a  state  so  that  the  trellis  diagrams  the  transitions  between 
these  states  from  one  node  level  to  the  next. 

A  trellis  for  a  (5,3)  code  with  parity  check  matrix 

110  1  0  ' 

10  1  0  1  J  , 

is  shown  in  Fig.  1.  Each  binary  n-tuple  traces  a  distinct  path  through 
the  trellis  corresponding  to  a  sequence  of  transitions  of  the  parity  check 
states. 

We  know  that  for  each  of  the  2  binary  sequences  of  length  n  which  are 

in  the  code  the  corresponding  sequence  of  parity  check  states  must  end  with 

the  all  zeros  state,  i.e.,  S(n)=£  since 

T  r 

xH  =  l  x.  h.  =0  , 

if  x.  is  a  codeword. 

Clearly,  then,  a  path  through  the  trellis  corresponding  to  a  codeword 
must  terminate  at  the  all  zero  node  at  the  final  level. 
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The  VA  operates  "by  finding  the  path  through  the  code  trellis,  correspond¬ 


ing  to  a  codeword,  which  maximizes  the  log-likelihood  functioned 

Lm{r}  =  log  f  ^(rl^)  , 

— '-m 

where  r_  represents  the  real -valued  receiver  output  sequence  and  f  i  -  ( • [x  ) 

— '-m  _I31 

describes  the  channel  transition  probability  mechanism.  If  we  assume  that 
the  channel  affects  each  transmitted  symbol  independently,  the  log-likelihood 
functional  becomes 

Lm{£}  =  l  log  f(rii3cmi)  • 

i=l 


In  the  particular  case  of  matched  filter  reception  of  binary  antipodal 
signals  transmitted  over  an  additive  white  Gaussian  noise  (AWGN)  channel,  the 
receiver  output  sequence  is  of  the  form: 


r.»  - 
1 

*/2E  /Id 
s  0 

+ 

n. 

i 

;  x.=o 

[• 

+ 

n. 

i 

;  vi  • 

where  E  is  the  signal  energy.  Id/ 2 
s 

is 

the 

double-sided  noise  spectral  density  in 

watts/Hz,  and  n^  is  a  zero-mean  Gaussian  variate  possessing  unit  variance. 

III.  A  Trellis  Searching  Algorithm  ; 

We  consider  a  section  of  a  directed  trellis  of  the  general  form  of  Fig.  2. 

The  full  trellis  may  be  very  vide.  In  general,  however,  we  will  consider  searching 
a  selected  sub-trellis  of  the  full  trellis  which  has  a  much  smaller  width. 


We  associate  a  distinct  state  S^(k)  with  each  node  at  any  given  node  level 

k.  Nodes  at  a  certain  node  level  are  connected  to  nodes  at  the  next  level  by 

directed  lines  called  branches.  We  denote  the  set  of  branches  emanating  from  a 

particular  node  S^(k)  by  I\(k)  and  the  size  of  this  set  by  L^(k).  We  sometimes 

wich  to  consider  a  distinguished  subset  of  I\(k).  A  subset  of  I\(k)  will  be 

denoted  as  H(k)  and  the  size  will  be  denoted  by  L|(k).  Finally,  each  branch  in 

the  trellis  has  associated  with  it  a  branch  symbol  x.  (k)  and  a  branch  metric 

1  5  J 

d.  (k).  The  quantities  x,d  and  S  are  illustrated  in  Fig.  2. 

1  j  J 

For  each  node  at  any  given  node  level  k'  it  is  possible  to  describe  a  path 
incident  upon  the  node  by  a  sequence  of  branch  symbols 

x_  —  j  i  •  •  •  5  ^  * 

where  x  corresponds  to  the  branch  travelled  from  a  node  at  the  j’th  node  level 
J 

to  its  successor  at  the  (j+l)'st  level.  We  also  associate  with  this  path  a  cum¬ 
ulative  path  metric  value  d  which  is  the  sum  of  the  branch  metric  values  for 
the  branches  in  the  incident  path. 

At  a  given  node  level  k,  we  assume  that  we  have  stored  a  list  of  nodes. 

The  list  also  contains  a  branch  sequence  and  cumulative  path  metric  corresponding 
to  a  selected  incident  path  for  each  of  the  nodes  on  the  list.  This  list  for 
a  given  node  level  k  is  shown  at  the  left  at  the  bottom  of  Fig.  2. 

We  now  describe  a  pr*  ..edure  for  stepping  through  the  trellis  to  the  next 


node  level,  thereby  extending  the  paths  and  creating  a  new  list  at  level  k+1. 

We  do  this  by  attempting  to  extend  each  of  the  paths  on  the  back  list  (labeled 
B  in  Fig.  2)  to  a  new  node  which  will  then  be  placed  on  a  forward  list  (labeled 
F  in  the  figure). 


For  each  of  the  nodes  on  the  back  list  at  level  k,  the  following  steps 
are  performed: 

1.  Choose  a  subset  IV(k)  of  the  set  I\(k)  of  branches 
emanating  from  the  node  to  be  extended  to  nodes  at 

the  next  node  level.  This  subset  contains  L!(k)  branches. 

1 

2.  Extend  the  path  by  traveling  along  one  of  the  L!(k) 
branches  in  the  subset  IV  (k)  emanating  from  this  node 
to  a  new  node  at  level  k+1.  Update  the  cumulative 
path  metric  by  including  the  contribution  from  the 
branch  metric  associated  with  the  branch  just  extended. 

3.  Search  the  forward  list  F  to  determine  if  the  new  node  is 
already  present.  If  the  node  is  not  present,  go  directly 
to  setp  5*  If  the  node  is  present  continue  to  step  U. 

U.  Compare  the  cumulative  path  metric,  just  calculated  with 
the  corresponding  metric  already  on  the  forward  list. 

If  the  new  metric  is  larger,  go  to  step  5-  If  not,  go 
to  step  6. 

5.  Extend  the  path  sequence  from  the  present  node  by  adding  the 
branch  symbol  associated  with  the  current  branch  to  the  path 
sequence  associated  with  the  node  under  consideration.  Re¬ 
place  the  cumulative  path  metric  and  path  symbol  from  the 
node  on  the  forward  list  with  those  just  produced  from  the 
transition  from  the  node  on  the  back  list.  This  effectively 
connects  the  node  on  the  forward  list  to  the  node  under  con¬ 
sideration  on  the  backward  list. 

6.  If  all  the  IV  (k)  branches  emanating  from  the  node  on  the  back 
list  have  not  been  exhausted,  go  to  step  2. 
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:ed  the  extension 


The  forvard  list  will  be  complete  after  we  have  perform 
procedure  for  each  of  the  nodes  on  the  back  list.  At  this  time  we  purge  all 
but  the  K  nodes  with  the  highest  path  value.  We  can  also  reorder  the  list 
any  way  we  desire  (e.g.,  in  order  of  decreasing  metric  value). 

Once  the  above  steps  have  been  performed  at  any  given  node  level  the 
algorithm  is  complete  and  we  have  proceeded  one  level  further  in  the  trellis. 

The  forvard  list  becomes  the  back  list  and  the  process  is  repeated  until  ve 
have  come  to  the  end  of  the  trellis. 

This  algorithm  has  some  similarity  to  the (K,L) algorithm  described  by 
Anderson  and  Mohan  [  7  ] .  It  is  one  of  a  class  of  tree  searching  schemes  called 
breadth  first  algorithms.  If  M  =  q  (ve  do  not  purge  any  nodes  at  any  node 
level)  and  we  extend  each  of  the  branches  emanating  from  the  nodes,  this  algorithm 
is  equivalent  to  the  Viterbi  algorithm  and  will  perform  true  maximum-likelihood 
decoding.  We  now  demonstrate  the  application  of  the  algorithm  to  soft-decision 
decoding  of  linear  block  codes  and  show  how  the  channel  reliability  information 
can  be  used  to  determine  an  appropriate  subset  of  the  branches  to  be  extended 
from  each  node. 

IV.  Application  to  Soft-Decision  Decoding  of  Binary  31ock  Codes; 

The  algorithm  described  above  can  be  used  to  search  the  parity  check  trellis 
defined  previously.  In  this  case  the  branch  symbols  are  simply  the  transmitted 
symbols  with  x.  ,(k)=0,l.  The  branch  metrics  for  binary  antipodal  signaling 
in  AWGN  then  become 

x.  ,(k) 

ai,j(k>  ■  V-l!  - 

with  r^  representing  the  k'th  component  in  the  received  real -valued  n-vector 
r=(r^,r2, —  ,rn).  We  also  associate  a  real-valued  vector  ...  san)  called 

the  reliability  vector.  These  positive  numbers  give  a  measure  of  the 


reliability  of  the  received  symbols  if  the  receiver  were  forced  to  make  a 
hard  decision.  Thus  of  is  large,  it  is  more  probable  that  a  decision 
made  by  the  receiver  upon  the  transmitted  symbol  is  correct.  For  Gaussian 
channels,  the  magnitude  of  the  real-valued  matched  filter  output  provides  an 
effective  measure  of  the  reliability  of  the  corresponding  hard  decision  esti¬ 
mate.  We  can  use  this  channel  reliability  information  to  aid  in  selecting  a 
subset  of  the  parity  check  trellis  to  be  searched.  At  the  first  k  node  levels 

for  an  (n,k)  linear  code  each  node  on  the  trellis  will  have  2  branches  emanat- 

n  — k  k 

ing  from  it.  The  width  of  the  trellis  is  bounded  by  min{2  ,2  }. 

12 

As  an  example,  the  trellis  for  the  Golay  (2U,12)  code  contains  2  =1*096 

nodes  or  states  at  its  widest  point.  There  is  no  advantage  in  using  the  VA 
for  this  code  since  word  correlation  decoding  gives  the  same  performance  with 
about  the  same  complexity. 

If  we  receive  a  certain  n-tuple  £  and  several  of  the  reliability  numbers 
are  large,  we  may  accept  the  corresponding  hard-decision  outputs  as  correct. 

In  terms  of  the  trellis  this  means  that  at  certain  node  levels  we  extend  the 
trellis  along  only  the  branch  corresponding  to  the  hard-decision  receiver  out¬ 
put.  At  certain  reliable  node  levels,  then,  the  trellis  does  not  grow  in 
width,  since  there  will  be  only  one  branch  emanating  from  each  node.  The  VA 
applied  to  a  sub-trellis  generated  in  this  way  is  equivalent  to  correlating 
the  received  vector  r  against  a  subset  of  the  codewords.  This  set  is  generat¬ 
ed  by  accepting  the  receiver  hard-decision  outputs  for  the  symbols  in  the  k-p 
most  likely  information  positions  and  allowing  all  possible  combinations  in 
the  remaining  symbol  positions.  In  this  way,  the  decoder  correlates 
against  a  set  of  code  words  which  all  agree  with  the  hard-decision  estimate 
in  the  most  likely  k-p  information  symbols  but  otherwise  exhaust 
all  other  combinations  of  digits  in  the  remaining  positions.  The  size  of 
the  subset  of  code  words,  and  hence  the  complexity  of  the  corresponding  subtrellis. 
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will  be  a  function  of  the  number  of  unconstrained  information  symbol  positions 

p.  The  choice  p=k  corresponds  to  true  maximum-likelihood  (ML)  decoding  provid- 
n  v 

ed  2  ,  while  the  opposite  extreme  p=0  corresponds  to  simply  accepting  the 

receiver  hard  decision  outputs  in  the  k  information  positions. 

The  full  algorithm  proceeds  by  first  generating  a  subtrellis  by  fixing  k-p 
of  the  most  reliable  information  symbols.  This  subtrellis  is  then  searched  up 
to  the  k'th  node  level,  extending  the  M  most  likely  paths  at  each  node  level. 
Finally,  after  the  k'th  node  level,  the  VA  is  applied  to  extend  the  surviving 
paths  to  the  final  all  zeros  node  and  select  a  codeword. 

The  parity  check  trellis  for  the  Hamming  (?,H)  code  is  shown  in  Fig.  3.  An 
example  of  a  subtrellis  generated  for  this  code  is  shown  in  Fig.  H.  In  this 
case  we  have  taken  p=2.  The  real  received  vector  r  is  shown  along  with  the  re¬ 
liability  vector  a  and  the  corresponding  receiver  hard  decision  estimate. 


V .  C omputational  Complexity: 


Massey  [8]  has  defined  3  quantities  which  are  useful  in  the  determination 


of  a  trellis  decoder  complexity.  The  number  of  digit  metrics,  to  be  evalu¬ 
ated  in  order  to  compute  path  metrics  is  equal  to  the  total  number  of  digits  on 


all  branches  of  the  trellis.  The  number  of  metric  comparisons,  ,  is  just  the 
number  of  nodes  in  the  trellis  which  have  two  entering  branches.  Finally,  the 


number  of  accumulators,  N_,  is  equal  to  the  maximum  number  of  nodes  or  states 

1  3. 

which  must  be  stored  at  any  given  node  level  as  the  decoder  progresses  through 
the  trellis.  For  standard  maximum  likelihood  deocding  of  a  given  (n,k)  lineat 
block  code,  Massey  has  developed  the  following  bounds  on  these  computational 


parameters : 


and 


N  <2n-k+1  [2k-n+2]-l*  > 

m — 

N  <2n_k  [2k-n+l]-l  , 

Qr~ 


(la) 

(lb) 

(lc) 
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valid"1"  for  k>n-k,  or  equivalently  R=k/n^_l/2. 

It  has  been  demonstrated  in  [8]  that  the  trellis  can  he  simplified 

in  certain  special  cases  and  the  computational  quantities  N  ,  N  ,  N  corres- 

m  c  a 

pondingly  reduced.  One  method  of  simplification  involves  rearranging  the 
columns  of  the  parity  check  matrix  resulting  in  a  nonsystematic  code.  It 
is  not  clear,  however,  that  this  procedure  would  offer  significant  simpli¬ 
fication  in  the  case  of  the  longer,  more  practical  codes.  The  conclusions 
then  is  that  computational  complexity  is  essentially  exponential  in  n-k. 

We  now  examine  how  the  computational  quantities  are  affected  by  the 

trellis  searching  algorithm  presented  here.  Consider  the  parameter  N  . 

a 

When  the  complexity  is  constrained  by  M,  it  is  clear  that  the  number  of 

M 

states  can  never  exceed  2  ,  since  only  M  nodes  are  extended  forward  at 
each  node  level.  When  the  complexity  is  constrained  by  p,  the  trellis  can 
double  in  width  at  most  p  times.  Therefore,  the  number  of  states  can  grow 
to  a  maximum  of  2P.  Taking  into  account  the  original  maximum  likelihood 
bound,  and  again  assuming  k^n-k,  we  have  the  result: 

N  <2  ,  (2) 

a— 

with  y=  min(n-k,  p,  log^M). 

Next,  consider  the  quantity  N  for  the  case  k_n-k.  At  node  levels  k, 
k+1,...,  n  exactly  one  branch  emerges  from  each  node.  Since  all  paths  from 
the  nodes  retained  at  level  k  must  remerge  to  join  at  the  final  node,  the 
number  of  metric  comparisons  in  this  portion  of  the  trellis  will 
be  equal  to  one  less  than  the  number  of  states  present  at  node 
level  k.  Since  we  have  constrained  the  maximum  number  of  states 

t  Similar  but  less  useful  bounds  are  available  for  R<l/2, 
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to  be  less  than  2  ,  the  number  of  metric  comparisons  in  the 

tail  of  the  trellis  can  be  no  greater  than  2Y-1.  Paths  in  the  full  trellis  nay 
also  remerge  at  levels  n-k+l,...,k.  If  we  retain  only  2Y  states  at  any  level, 

•  y_1 

it  is  clear  that  we  can  have  at  most  2  remergences  at  the  next  node  level, 

necessitating  1  metric  comparison  for  each.  At  these  node  levels  then,  the 

number  of  metric  comparisons  does  not  exceed  (2k-n)2  .  The  number  of  metric 

comparisons  within  the  entire  trellis  therefore  satisfies: 

N  <(2Y-l)+(2k-n)Y_1  , 

c — 

or  the  weaker  bound 


Nc<2Y [ 2k-n+l ] -1  .  (3) 

Finally,  we  consider  the  quantity  N  ,  the  number  of  metric  digit  calcula- 

m 

tions.  This  is  simply  the  total  number  of  branches  in  the  reduced  state  trell¬ 
is.  Since  N  is  the  maximum  number  of  accumulators,  the  worst  case  occurs 
a 

when  the  trellis  expands  to  its  full  width  at  the  earliest  node  levels  and 
remains  at  this  width  until  it  is  forced  to  close  in  the  tail  section.  Under 

v 

these  conditions,  if  2  is  the  maximum  number  of  accumulators  or  nodes,  the 
.  i+1 

trellis  will  have  2  branches  traveling  forward  from  nodes  at  level  i  up 
to  node  level  .  At  levels  y,  y+1 , . . .  ,n-y-l  there  will  be  at  most  2.2Y 
branches  traveling  forward  to  the  next  node  level.  Finally,  in  the  tail  at 
levels  n-y,  n-y+1 , . . .  ,n-l  there  will  be  2n  1  branches  traveling  forward  from 
node  level  i  for  i=n-y , n-y+1 ....  ,n-l.  This  gives  a  total  of 

y-1  ...  n-2y  ,,  n-1  .  y  . n-2y 

l  21+1  +  l  2Y+1  +  l  2n_1  =  l  21+1  +  l  2Y+1 

i=0  i=l  i=n-y  i=l  i=l 


-2^-b  ♦  T  2-'*1 
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metric  computations  under  worst  case  conditions.  Assuming  2y<n  then  we  have 


the  bound 

N  <2Y+1[2k-n+2]-U  .  ( U ) 

m— 

Comparing  the  bounds  in  ( 2 )  —  ( U )  with  the  corresponding  results  in  (l)  it 
is  clear  that  the  decoding  complexity  can  be  controlled  through  choice  of  M 
and  p  to  aboid  the  exponential  dependence  upon  (n-k)  associated  with  the  full 
trellis  search.  As  an  example  of  the  use  of  these  bounds  consider  the  situa¬ 
tion  illustrated  previously  in  Fig.  U  for  the  Hamming  (7,*0  code.  A  compari¬ 
son  between  the  actual  computational  complexity  of  the  reduced-search  algor¬ 
ithm^  and  the  preceding  bounds  is  provided  in  Table  1.  Also  included  is  a  com¬ 
parison  between  the  actual  values  for  the  best  nonsystematic  implementation  of 
this  code  using  a  full  trellis  search  (ML  decoding)  and  the  corresponding 
bounds  predicted  from  (l).  We  feel  that  these  bounds  do  provide  a  reasonable 
indication  of  actual  computational  effort  and  indicate  the  potential  computa¬ 
tional  advantages  of  the  reduced-search  trellis  decoding  algorithm  when  appli¬ 
ed  to  larger  more  powerful  codes. 

VI.  Simulation  Results: 

We  have  performed  extensive  simulations  of  the  reduced-search  trellis  al¬ 
gorithm  on  the  R.P.I.  Interactive  Communication  Simulator  ( ICS ) [ 9 ] -  This  is 
an  extensive  hardware-software  system  for  interactive  simulation  of  a  variety 
of  point-to-point  communication  systems.  In  addition  to  the  decoder,  the  other 
components  of  the  system  model  included  a  binary  equiprobable  source,  a  block 


t  It  should  be  noted  that  although  there  are  22  branches  in  the  trellis  of 
Fig.  L,  only  16  branch  metrics  need  be  computed  since  the  branch  metric 
in  the  fixed  positions  are  common  to  all  paths  and  thus  need  not  be  com- 


encoder,  a  BPSK  modulator  and  demodulator  and  an  AWGN  channel.  To  insure 

statistical  accuracy  over  the  range  of  signal  to  noise  ratios  which  were 
7 

simulated,  10  symbols  were  processed  by  the  system  for  each  test  case. 

The  codes  we  chose  for  the  simulations  were  the  Hamming  (7,*0  and  Golay 
(23,12)  perfect  codes  along  with  their  extended  versions,  the  Hamming  (8,M 
and  Golay  (2l*,12).  We  discuss  only  the  results  for  the  extended  codes.  Re¬ 
sults  for  the  perfect  codes  are  similar. 

In  the  first  set  of  simulations  illustrated  in  Fig.'s  5  and  6,  M  was  set 
equal  to  t.he  maximum  width  of  the  trellis.  In  this  case,  no  nodes  are  purged 
from  the  list  at  any  node  level.  At  the  same  time,  p  (the  number  of  uncon¬ 
strained  information  positions)  was  varied  and  a  simulation  was  performed  for 
each  choice _of  this  parameter.  For  the  Hamming  code,  we  have  chosen  p=0,l,2, 
3,1*.  As  indicated  previously,  the  case  p=0  corresponds  to  simply  accepting 
the  receiver  hard  decision  outputs  and  thus  the  upper  curve  in  the  figure 
represents  uncoded  performance.  For  purposes  of  comparison  we  have  also  dis¬ 
played  the  performance  of  a  conventional  hard-decision  algebraic  decoder. 

This  is  a  decoder  whose  estimate  of  the  transmitted  codeword  is  the  code¬ 
word  at  the  minimum  Hamming  distance  from  the  receiver  hard  decision  vector  z. 
Note  that  for  p=2  the  performance  of  the  trellis  decoder  is  about  .  5dB  better 
than  that  of  the  conventional  algebraic  decoder,  whereas  p=3  gives  perform¬ 
ance  nearly  eaual  to  maximum-likelihood. 

For  the  Golay  code,  we  have  chosen  p=0,3,5,7.  The  full  trellis  for  the 
Golay  code  as  too  complex  to  search  within  a  reasonable  time  and  therefore 
the  maximum-likelihood  performance  p=12  is  not  shown.  We  have  included  a  lower 
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bound  on  correlation  decoding  (cf.  [l] )  for  purposes  of  comparison.  Note  that 
for  the  Golay  code,  the  choice  of  p=3  gives  performance  nearly  equal  to  the 
conventional  decoder. 

In  the  second  set  of  simulations  (Fig.'s  7,8)  we  have  set  p  to  its  maxi¬ 
mum  value  and  have  purged  all  but  M  nodes  at  each  node  level.  For  the  Hamming 
code,  we  have  chosen  M=1 ,2 ,8,l6.  For  M=U  the  trellis  decoder  performs  about 
. 8dB  better  than  the  conventional  decoder  and  M=8  gives  essentially  maximum- 
likelhood  performance.  For  the  Golay  code,  we  have  chosen  M=l,U,l6,6!+,128. 

In  this  case,  M=l6  gives  performance  comparable  to  the  conventional  decoder. 

In  the  third  and  final  set  of  simulations  (Fig.'s  9,10)  we  have  set  the 
value  of  M  to  a  fixed  intermediate  value  and  varied  p.  A  simulation  was  again 
performed  for  each  choice  of  p.  For  the  Hamming  code,  we  chose  M=U  and  p=l,2,3. 
For  the  case  p=3,  the  performance  is  essentially  the  same  as  in  Fig.  8  for  M=U. 
For  the  Golay  code,  we  chose  M=6U  and  p=3,5,7,9-  For  p=7  the  performance  is 
about  the  same  as  in  Fig.  8  for  M=64. 

VII.  Summary  and  Conclusions: 

We  have  described  a  general  algorithm  which  performs  a  reduced  complexity 
search  of  an  appropriately  defined  trellis.  This  algorithm  can  be  applied  to 
the  decoding  of  linear  block  codes.  Through  computer  simulation  we  have  demon¬ 
strated  the  tradeoff  between  computational  complexity  and  bit  error  performance 
for  two  such  codes.  Interesting  research  directions  would  be  to  develop  explicit 
performance  bounds  as  a  function  of  M  and  p  and  to  apply  the  algorithm  in  the 


decoding  of  convolutional  codes. 
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Table  1 

Computational  Requirements  for  Soft-Decision 
Trellis  Decoding  of  Hamming  (7,*0  Code. 
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Figure  6 


Performance  of  Reduced- Search  Trellis  Decoder  for  Golay  (2b, 12)  Code 

12 

with  M=2  and  Various  p. 
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Bit  Error  Probability, 


Performance  of  Reduced-Search  Trellis  Xecoder  for  Hamming  (8,14)  Code 

vith  p=k  and  Various  M. 
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Figure  8 

Performance  of  Reduced-Search  Trellis  Decoder  for  Golay 
(2^,12)  Code  with  p=12  and  Various  M. 
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I. 


Introduction : 


The  use  of  Viterbi  decoding  and  sequential  decoding  techniques  to  achieve 
reliable  communications  has  increased  significantly  over  the  past  decade.  As 
a  result  there  has  been  a  commensurate  increase  in  the  need  for  good  convol¬ 
utional  code  constructions.  In  the  past,  much  effort  has  been  expended  in 
finding  optimal,  i.e.,  good  in  some  carefully  defined  sense,  low  rate  R=l/n 
convolutional  codes.  In  general,  convolutional  codes  used  in  conjunction 
with  Viterbi  decoding  should  possess  a  maximum  free  distance  property,  while 
for  codes  used  with  sequential  decoding  the  column  distance  profile  property 
is  important  (cf.  [ 1 ] ) .  Some  of  the  early  optimal  binary  convolutional  codes 
of  practical  interest  in  connection  with  Viterbi  decoding  were  reported  by 
Odenwalder  [2],  and  Larsen  [3].  These  were  short  constraint  length  convol¬ 
utional  codes  with  maximal  free  distance  for  rates  R=l/2,  1/3  and  1/U.  Later, 
binary  convolutional  codes  at  these  same  rates  were  reported  by  Johannesson 
[l],  [U]  possessing  an  optimum  column  distance  profile.  In  each  of  these 
cases  the  code  constructions  were  determined  by  computer  search  methods  due 
to  the  lack  of  any  analytical  procedures  for  obtaining  optimal  rate  R=l/n 
convolutional  codes.  In  particular,  the  codes  reported  by  Larsen  [3]  were 
found  by  a  judicious  choice  of  the  code  generating  polynomials  followed  by 
computer  verification  of  their  free  distance. 

In  many  applications,  bandwidth  restrictions  are  quite  severe  thereby 
dictating  higher  rate  R=b/n  channel  codes  if  effective  error-control  pro¬ 
tection  is  to  be  provided.  Unfortunately,  very  little  progress  has  been  made 
in  finding  good  convolutional  codes  for  arbitrary  rational  rates  greater  than 
1/2.  Early  work  in  developing  the  algebraic  structure  of  high  rate  convolu¬ 
tional  codes  aas  initiated  by  Forney  [5],  [6].  Specifically,  Forney  developed 
a  linear  correspondence  between  the  states  of  a  rate  R=b/n  convolutional 


encoder  with  code  generator  matrix  g  and  the  states  of  a  corresponding  syn- 
t  f 

drome  former  g  .  Here  H  denotes  the  transpose  of  the  generator  matrix 
corresponding  to  an  encoder  of  the  code  dual  to  the  code  generated  by  G. 
However,  Forney's  results  have  stimulated  only  a  very  limited  number  of 
rational  rate  code  construction  investigations.  Generally,  these  investiga¬ 
tions  have  centered  on  code  rates  of  the  form  R=(n-l)/n.  Paaske  [ T 3  has  pro¬ 
vided  tabulations  of  short  constraint  length  binary  convolutional  codes  with 
maximal  free  distance  only  for  rates  2/3  and  3/k.  Furthermore,  Paaske' s  re¬ 
sults  were  obtained  by  computer  search  procedures  due  to  the  unavailability 
of  strictly  analytic  methods.  Results  of  a  computer  search  for  systematic 
convolutional  codes  with  rates  R=(n-l)/n  for  n=3, 8  have  been  given  by 
Hagenauer  [8].  These  codes,  which  are  used  in  conjunction  with  sequential 
decoding,  are  good  in  the  sense  that  they  maximize  a  column  distance  function. 
Another  class  of  high  rate  R=(n-l)/n  convolutional  codes  have  been  reported 
by  Wu  [9],  These  codes  are  convolutional  self-orthogonal  codes  (CSOC's) 
which  are  known  to  be  threshold  decodable.  Wu  has  provided  extensive  tab¬ 
ulations  of  CSOC's  for  n=3,1», . . .  jl1*. 

A  class  of  punctured  convolutional  codes  of  rate  R=(n-l)/n  have  been 
investigated  by  Cain,  et  al.  [10]  using  computer  searches.  This  class  of 
codes  is  of  interest  since  the  implementation  of  Viterbi  decoders  for  high 
rate  convolutional  codes  is  greatly  simplified  if  the  code  structure  is  that 
of  a  punctured  low  rate  code.  As  an  example,  suppose  a  rate  R=l/2  code  is 
used  with  every  fourth  encoder  output  bit  deleted.  This  code  will  produce 
three  channel  bits  for  every  two  information  bits,  i.e.,  it  will  be  a  R=2/3 
code.  Hence,  the  R=2/3  code  was  constructed  by  periodically  deleting  bits 
from  a  R=l/2  code,  or  by  puncturing  the  code.  The  advantage  is  that,  after 


appropriate  modification  of  the  received  bit  stream,  the  Viterbi  decoder 
can  be  implemented  as  a  R=l/2  decoder  requiring  only  binary  comparisions 
at  each  decoder  state.  One  disadvantage  is  that  by  puncturing  a  R=l/2  code 
its  free  distance  is  reduced.  We  expect  this  to  be  the  case,  however,  since 
high  rate  convolutional  codes  generally  have  smaller  free  distance  profiles 
than  low  rate  R=l/n  codes.  The  free  distance  df  of  a  rate  R=(n-l)/n  con¬ 
volutional  code  obtained  by  puncturing  a  code  of  rate  R=l/n  is  almost  always 
equal  to  the  bound  for  standard  convolutional  codes.  However,  in  some  cases 
df  for  a  punctured  code  is  less  than  df  for  a  standard  code  at  the  same  rate 
and  constraint  length. 

Recently,  a  new  class  of  time-invariant  binary  convolutional  codes  have 
been  defined.  Lauer  [ll]  describes  the  construction  of  some  optimal  partial- 
unit-memory  (PUM)  codes.  Here  the  encoders,  at  rational  rates  R=b/n,  do  not 
require  full  storage  of  the  previous  subblock  of  b  information  bits.  The 
codes  are  optimal  in  the  sense  of  having  maximum  free  distance  for  a  given 
rate  and  constraint  length.  Unfortunately,  code  constructions  for  ration¬ 
al  rates  are  only  provided  for  two  R=2/3  codes. 

The  purpose  of  this  paper  is  to  present  good  rational  rate  convolution¬ 
al  code  constructions  for  rates  R=(n-k)/n,  k=l,2, . . . ,n-l  with  n=2,3,...,8 
and  constraint  lengths  K=3,U,...,8.  A  tabulation  of  codes  at  these  rates 
will  significantly  extend  the  existing  results  of  rates  2/3  and  3A  for 
standard  convolutional  codes  [7].  The  rational  rate  codes  tabulated  here¬ 
in  are  short  constraint  length,  nonsystematic ,  binary  convolutional  codes 
with  maximum  free  distance  for  use  with  Viterbi  decoding. 


II.  Preliminaries: 


Figure  1  illustrates  a  typical  nonsystematic  binary  convolutional  encoder. 
Here  a  convolutional  code  with  constraint  length  K  and  rate  P.-b/n  is  represent¬ 
ed  as  a  shift  register  of  length  K  coupled  with  n  multi-input  modulo-2  adders. 

A  group  of  b  information  bits  are  shifted  into  the  shift  register  and  the  out¬ 
puts  of  the  n  adders  are  sampled  and  transmitted  sequentially.  The  code  it¬ 
self  is  determined  by  the  connections  between  the  shift  register  stages  and 
the  modulo-2  adders.  All  codes  in  the  sequel  will  be  specified  by  n  connect¬ 
ion  vectors,  one  for  each  adder,  and  denoted  £  ,£g , . . .  The  j'th  component 

of  is  a  1  if  stage  j  of  the  shift  register  is  connected  to  the  i'th  adder, 
otherwise  the  j'th  component  is  zero.  The  code  connection  vectors  are  usual¬ 
ly  given  in  octal  notation  and  are  right-justified. 

The  criterion  of  goodness  employed  here  is  that  of  maximizing  the  free 
distance  of  the  code  at  a  specified  rate  and  constraint  length.  Hence,  it 
is  desirable  to  make  use  of  an  upper  bound  on  the  free  distance,  or  unre¬ 
stricted  minimum  distance,  of  a  confolutional  code.  For  rate  R=l/n  codes  an 
upper  bound  has  been  given  by  Heller  [12].  This  bound  has  been  obtained  by 
utilizing  the  fact  that  a  terminated  binary  convolutional  codes  with  L  inform¬ 
ation  bits  is  a  group  code.  For  arbitrarily  large  L  the  bound  is  defined 
accoridng  to 

2*"1 

d  <min  — - —  (K+£-l)n 
1<*  _  2  -1 

where  [x]  deontes  the  largest  integer  less  than  or  equal  of  x,  and  K  is  the 
code  constraint  length.  Odenwalder  [2]  has  shown  that  if  dR  is  odd,  then  the 
bound  of  (l)  can  sometimes  be  improved.  In  particular,  it  has  been  shown 
that  if  (l)  is  optimized  with  £=h  and  the  inequality  given  by 
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d„  <  £—  {n(h+K-l )-l+21-n} 
H  "  2n-l 


is  not  satisfied,  then  d„  may  he  decreased  by  one.  That  is, 

af  i  V1  •  (3) 

The  improved  upper  bound.  has  been  found  to  be  very  tight  at  small  constraint 
lengths . 

A  modified  version  of  the  Heller  bound  (l)  is  used  to  upper  bound  the 
free  distance  of  a  rational  rate  R=b/n  convolutional  code.  In  terms  of  the 
code  parameters  b,  n  and  K  the  bound  is  given  by 

d  <_  min  ^ - (K+i-b)r-  =d^  (b) 

f  I<iL21-l  DJ  ^ 

vhere,  [xj  again  denotes  the  largest  integer  less  than  or  equal  to  x  and 

'  1  if  K<2b-1 

1=  ■  (5) 

b  if  K>2b-1  . 

The  two  conditions  on  I  described  in  (5)  have  been  found  to  be  necessary 
because  when  the  shift  register  length  K  is  not  large  enough  to  retain  a 
group  of  b  information  bits  for  more  than  one  shift  cycle  the  distance 
properties  of  the  code  are  significantly  reduced.  If  d^  is  odd,  the  bound 
in  (M  can  sometimes  be  improved  in  a  fashion  similar  to  that  described 
earlier.  If  (M  is  optimized  with  i=h  and  the  inequality  now  given  by 


d^  <  ——  {^{h+K-b)-l+21  h}  (6) 

is  not  satisfied,  then  d^  may  be  decreased  by  one.  That  is,  for  rate  R=b/n 
binary  convolutional  codes 
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(7) 


^V1  • 

The  bounds  described  above  will  be  used  exclusively  in  the  search  for  good 
codes. 

Ill .  Code  Search  Procedure: 

The  procedure  used  to  find  new  rate  R-b/n  codes  is  an  extension  of  the 
search  technique  outlined  by  Odenwalder  [2]  for  rate  R=l/n  codes.  This  pro¬ 
cedure  is  best  described  in  terms  of  a  state  diagram  and  code  trellis  approach 
(cf.  [13]  for  details).  The  convolutional  encoder  of  Fig.  1  can  be  considered 
a  finite-state  machine  with  2  -  states.  Hence,  a  rational  rate  Viterbit  decod¬ 
er  will  have  2K_b  states.  Furthermore,  b  information  bits  give  rise  to  2^-1 
possible  non-zero  input  patterns  which  may  be  input  to  the  shift  register.  In 
what  follows  we  will  assume  that  the  all-zero  information  sequence  is  encoded 
and  transmitted  across  a  perfect  channel  with  a  rational  rate  Viterbi  decoder 
being  used  to  decode  the  received  data.  The  search  strategy  will  be  to  find 
the  minimum  weight  (Hamming  distance)  of  any  path  through  the  code  trellis 
which  diverges  from  the  all-zero  path  at  some  point  and  remerges  at  any  le.ter 
time.  Such  paths  are  called  adversaries,  wherein  the  minimum  weight  of  these 
adversaries  is  simply  the  minimum  free  distance  dv  of  the  code. 

In  the  case  of  rate  R=l/n  codes,  the  search  process  was  started  by  re¬ 
stricting  the  first  decoded  bit  to  be  a  one.  Then  at  any  time  i>K  the  Viterbi 

K— 1 

decoder  will  retain  2  paths.  Each  path  will  be  the  minimum  weight  path 
which  diverges  from  the  all-zero  state  at  some  point  and  terminates  in  a  part¬ 
icular  state  at  time  i.  For  large  enough  i  the  minimum  free  distance  of  the 
code  is  equal  to  the  weight  of  the  path  which  terminates  at  the  all-zero 
state.  Additional  information  such  as  1. )  the  number  of  such  minimum  weight 
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paths  (adversaries)  which  terminate  in  each  state  at  each  level  in  the  trellis, 
and  2.)  the  number  of  bit  errors  that  these  paths  represent  is  also  col_ected. 
For  a  rational  rate  R=b/n  code  the  above  search  strategy  is  employed, 


however,  the  search  for  the  minimum  free  distance  must  be  done  for  each  of 


the  2  -1  possible  non-zero  input  information  bit  patterns.  Hence,  a  total 
of  2^-1  minimum  free  distance  paths  through  the  trellis  must  be  found.  A 
sequence  of  free  distances  {d^j^,  d^,  |  2 » •  •  •  ,d^,  |  ^),  one  for  each  non-zero 

input  pattern,  is  found  for  a  rate  R=b/n  code.  By  definition  the  free 
distance  df  of  the  code  is  taken  to  be  the  minimum  over  all  possible  input 


patterns,  i.e. 


df  =  min  {dfh,  df|2,...,df|2b_1>  . 


(8) 


It  is  particularly  illuminating  at  this  point  to  consider  a  state 
diagram  interpretation.  For  example,  Fig.  2  illustrates  the  state  diagram 
for  a  R=2/3,  convolutional  code  with  a  specified  set  of  connection 
vectors.  There  are  four  decoder  states  and  three  possible  non-zero  input 
bit  patterns.  Starting  from  the  00  state,  the  first  input  bit  pattern  01 
leaves  the  00  state.  There  are  two  minimum  weight  paths  at  distance  5  with 
a  total  of  three  bit  errors.  However,  for  input  pattern  11  there  is  one 
oath  at  a  minimum  weight  of  3-  Hence,  by  (5)  the  free  distance  of  the  code 


in  Fig.  2  is  0^=3. 

A  modified  rational  rate  Viterbi  decoding  algorithm  constructed  by  Wismer 
[lU]  was  used  to  discover  good  rational  rate  convolutional  codes.  A  limited 
commuter  search  of  rational  rate  codes  vas  performed  to  obtain  good  code  con¬ 
structions,  i.e.,  code  connection  vectors  (CCV's),  for  specified  rates  R  and 
constraint  lengths  K.  The  search  procedure  is  outlined  as  follows: 
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Step  1 .  ) 

Step  2.  ) 
Step  3 .  ) 


Step  U . ) 


Step  5* ) 


Step  6. ) 


Choose  rate  R=b/n  and  constraint  length  K.  Obtain  a  bound  on 
the  free  distance  d^  of  the  code  from  (b)  and  the  sequel. 

Set  the  CCV's  {g^ ,§£ , • • • ,gn)  t0  some  initial  values. 

Apply  the  algorithm  and  obtain  the  sequence  of  free  distances 
{df |l’df|2*..»df|2b_1}  ,  the  number  of  adversary  paths  and  the 
corresponding  number  of  bit  errors  in  the  paths. 

Iterate  on  the  CCV's  collecting  all  codes  which  meed  the  bound 
for  df . 

Eliminate  all  catastrophic  codes  from  the  set  of  candidate  codes. 
Here  a  code  is  catastrophic  (cf.  [ 15 ] )  if,  in  terms  of  its  state 
diagram,  there  is  a  closed-loop  zero-weight  path  from  some  non¬ 
zero  state  back  to  itself. 

Choose  the  set  of  codes  (CCV's)  with  the  highest  average  value 
df  amojng  all  2b-l  input  patterns;  here  we  define 


2b-l 


df  = 


2b-l 


i=l 


(9) 


Step  J.)  If  the  code  set  of  Step  6  contains  more  than  one  code,  only 

those  codes  with  the  smallest  number  of  bit  errors  among  their 
adversaries  at  the  minimum  distance  plus  one  were  retained.  If 
more  than  one  code  still  remained,  the  adversaries  at  the  minimum 
distance  plus  two  were  retained,  etc. 

The  codes  determined  in  this  fashion  are  the  best  code  for  that  rate  and 
constraint  length  in  that  the  free  distance  has  been  maximized  and  the  number 
of  bit  errors  in  the  adversaries  have  been  minimized.  Results  of  computer 
searches  for  codes  at  rates  R=(n-k)/n,  k=l ,2 , . . .  ,n-l  with  n=2,3,...,8  and 
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for  constraint  lengths  K=3, 8  are  tabulated  in  Tables  1-18. 


Results  for  low  rate  codes  R=l/5 ,1/6 ,1/7  and  1/8  indicate  that  these 
codes  have  good  error-correcting  capability  due  to  their  very  large  free  dis¬ 
tances  (cf.  [l6]).  These  explicit  low  rate  code  tabulations  are  useful  in 
that  they  do  not  appear  in  the  literature.  Results  for  higher  rate  codes  in- 

4- 

dicate  that  the  free  distance  is  markedly  inferior  to  the  lower  rate  codes ' . 
Increasing  the  constraint  length  K  beyond  8  will  improve  the  distance  profile 
of  these  codes,  however,  the  complexity  of  the  search  procedure  increases  ex¬ 
ponentially  with  K.  It  is  noted  that  several  rate  2/3  and  3A  codes  have  beei 
found  which  have  a  higher  df  than  those  codes  reported  by  Paaske  [?]. 

IV.  Performance  Evaluation: 

Bit  error  probability  performance  bounds  for  rational  rate  convolutional 
codes  can  be  obtained  using  a  union  bounding  technique  (cf.  [ 13 ]  for  details) 
The  bound  for  rate  R=l/n  codes  on  memoryless  channels  can  be  expressed  in 
terms  of  the  code  generating  function  T(D,N)  as 


P.<K  dTiD^Nl 
b-  0  dN 


(10) 


N=1 ,D=Dr 


where  the  constants  KQ  and  depend  upon  the  particular  code  employed,  the 
modulation  strategy  in  use  and  the  channel  parameters.  Evaluation  of  the 
derivative  of  the  code  generating  function  is  given  by  the  power  series 
expansion 

„k 


dT(N,D) 


dN 


=  I  <y> 

N=1  k=df 


(11) 


where  c^  represents  the  total  number  of  bit  errors  in  all  the  adversaries 


+  Recall  that  for  rate  R=b/n  codes  df  is  defined  according  to  (8)  which  is 
a  rather  pessimistic  indication  of  the  code  error  correcting  capability. 
For  many  input  patterns  d„i .  can  be  much  larger  than  d_. 


vhich  differ  in  exactly  k  symbol  positions  from  the  correct  path.  For  a 


rate  P=b/n  code,  one  branch  corresponds  to  b  information  bits  and  the  bit 
error  probability  is  then 


I 


m=l 


P' ,  P 
b  m  m 


(12) 


vhere  P'  i  is  the  conditional  probability  of  a  branch  error  given  incut 
b  |m 

pattern  m,  ?_  is  the  corresponding  a  priori  probability,  and  M=2°-l  correspond 

ing  to  the  number  of  non-zero  input  patterns.  Each  of  these  patterns  give 

rise  to  a  minimum  distance  d  >  and  a  set  of  coefficients  c.i  so  that, 

f  |m  1  [m 

corresponding  to  (10)  and  (ll),  ve  have  the  bound 


ao 


p ’ ,  <  K  T  c  i  D* 

b , m  —  0  _  ,  c£|m  0 

T  |m 


(13) 


It  follows  that  the  bit  error  probability  can  be  bounded  according  to 


1  M  oo  »  . 

P"o  ~  b  K0  ^i  |  .^d  C£  |m^0  f  Pm  * 

m-i  |£-df. 


(lM 


where  it  is  usually  assumed  that  Pm=l/(2  -l),  i.e.,  that  input  patterns  to  the 


encoder  are  equally  likely.  For  the  particular  case  of  BPSK  modulation  on  the 
AWGN  channel  the  constants  are  given  as 


and 


(15a) 

(I5b) 


where  E,  /N„  is  the  energy  per  information  bit  normalized  to  the  single-sided 
o  0 

noise  spectral  density  Nn  watts/Hz. 


Code  performance  evaluations  are  provided  in  Table  19  for  selected 
rational-rate  convolutional  codes  at  various  channel  signal -to-noise  ratios 
(SNR)  of  practical  interest.  Table  19  provides  both  the  computed  bound  (ih) 
on  bit  error  probability  P  ,  and  that  obtained  by  simulation  using  a  rational 
rate  Viterbi  decoding  algorithm  together  with  BPSK  modulation  on  the  AWGN 
channel.  The  matched  filter  outputs  were  uniformly  quantized  to  3  bit  accur¬ 
acy.  Results  indicate  performance  within  0.2-0. 5d£  of  the  bounds  which,  of 
course,  assume  infinitely  fine  quantization.  This  is  entirely  consistent  with 
simulation  results  described  in  [l6]. 

V.  Summary  and  Conclusions: 

Tabulations  of  selected  rational  rate  convolutional  code  constructions 
have  been  provided.  These  binary  short  constraint  length  codes  were  obtained 
by  a  limited  computer  search  using  a  modified  rational  rate  Viterbi  decoding 
algorithm.  The  codes  are  best  in  the  sense  that,  for  a  given  rate  and  con¬ 
straint  length,  the  reported  code  possesses  the  maximum  free  distance  profile 
and  the  minimum  number  of  adversaries  in  the  weight  spectrum.  An  explicit 
expression  has  been  provided  for  computing  bounds  on  the  bit  error  probabil¬ 
ity  directly  from  the  weight  spectrum  of  a  specific  code  and  the  channel  para¬ 
meters.  We  have  shown  through  simulation  that  this  bound  is  quite  useful  in 
describing  code  performance. 

The  codes  reported  herein  have  immediate  application  in  the  study  of  com¬ 
bined  source-channel  coding  in  the  context  of  image  transmission  [17],  [l8] . 

In  addition,  these  codes  are  expected  to  have  a  number  of  applications  in 
coding  for  burst  or  impulsive  noise  channels  [19]. 
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Rate  R=b/n  Binary  Convolutional  Encoder 
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Input  Pattern 

Minimum  Distance 
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Figure  2 

State  Diagraffi  for  R=2/3,  K=U  Convolutional  Code 


Code  ( Octal ) 


rree  Distance 


7  Codes  found  "by  Paaske  [  7  ] . 


i achieved 


Table  1 

Best  Rate  2/3  Convolutional  Cedes 


bound 


3 

6 

i 

2  2 

IT 

06 

15  3 

33 

15 

22  1 

I 

27 

75 

72  j  5 

177 

055 

112  |  6 

236 

155 

337  |  7 

i 

Constraint 
Length.  K 


Code  (Octal) 


achieved 


bound 


13 

15 

02 

ll* 

1 

i  2 

1 

36 

ll* 

32 

07 

!  3 

13 

25 

61 

1*7 

1 

!  1* 

127 

0U5 

106 

172 

1 

1* 

0U5 

121 

216 

357 

;  5 

» 

\ 

Table  2 

»st  Rate  3/*1  Convolutional  Codes 


Constraint 
Lenctfc.  Z. 


Code  (Octal) 


acnievec 


e 


bound 


1* 

17 

03 

06 

12 

Ok 

5 

37 

15 

26 

13 

3k 

6 

35 

23 

75 

6l 

k7 

1 

7  1 

|  03k 

127 

0k5 

106 

172 

8 

33^ 

Okk 

12k 

216 

357 

Table  5 

Best  Rate  3/5  Convolution 


Constraint 

Lenrth,  K 

Code 

(Octal) 

5 

13 

Ok 

11 

26 

36 

6 

67 

15 

26 

52 

57 

f 

013 

023 

056 

132 

17 k 

8 *■ 

237 

27k 

156 

255 

337 

Free  Distance 


achieved 


All  d^=k  codes  found  were  catastrophic. 


Table  6 

Best  Rate  b/5  Convolutional  Codes 


Table  7 

Best  Rate  1/6  Convolutional  Cedes 


Code  (Octal) 


!  Free  Distance 
!  l 

I  achieved  !  Bound 


6 

55  05  11 

!  I 

2 

2 

20  02  76 

7 

025  071  123 

2 

3 

0U6  111  175 

i 

8 

3^2  362  065 

3 

116  213  377 

Constraint 
Lenrth.  K 


Code  (Octal) 


Free  Distance 


acmevea 


Table  11 

Best  Rate  3/7  Convolutional  Codes 


bound 


1* 


Code  (Octal) 


Free  Distance 


acmevec 


bound 


13 

07 

31 

23 

25 

35 

37 

11 

lU 

15 

67 

26 

52 

57 

115 

010 

013 

023 

056 

132 

171* 

130 

067 

237 

27^ 

156 

255 

337 

Constraint 

Lenpth,  K 

Code  (Octal) 

Free  Di 

.stance 

achieved 

bound 

• 

6 

36 

20 

05 

55 

2 

2 

02 

11 

76 

7 

025 

111 

172 

11*7 

3 

3 

156 

122 

177 

6 

026 

031* 

270 

065 

1* 

It 

116 

213 

377 

Table  12 

Best  Rate  5/7  Convolutional  Codes 


Constraint 

Lenpth.  K 

Code  ( 0c 

tal) 

Free  D: 

.stance 

m 

7 

003 

005 

010 

021 

2 

2 

01*1 

101 

176 

P 

207 

111 

123 

252 

2 

3 

31*5 

26l 

336 

Table  lU 

Best  Rate  6/7  Convolutional  Codes 


Code  (Octal) 


ance 


Length.  K  1 ! achieved  1  bound 


3 

7  7  5  5 

5  7  7  7 

21 

21 

1* 

17  17  13  13 

13  15  15  17 

26 

26 

5 

37  33  25  25 

35  33  27  37 

32 

32 

6 

57  73  51  65 

75  1*7  67  5.7 

36 

36 

7 

153  111  165  173 

135  135  11*7  137 

lo 

1*0 

8 

275  275  253  371 

331  235  313  357 

1*5 

1*5 

Table  15 

Best  Rate  1/8  Convolutional  Codes 


"^Partial  Searcn  Table  l6 


Best  Rate  3/8  Convolutional  Cedes 
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Constraint 

Length,  K 

Code  (Octal) 

Free  Distance 

achieved 

bound 

i  .  ■  - 

6 

53 

60 

02 

55 

3 

05 

11 

76 

36 

7 

076 

025 

122 

111 

i* 

l 

056 

172 

11*7 

177 

8 

320 

026 

213 

031* 

5 

5 

116 

270 

065 

377 

Table  1? 

Best  Rate  5/8  Convolutional  Codes 


F.=2/3,  K=l* 
Code 


T  />T 

*v“o 


Computed 
Bound '  ,  P 


7.39x10" 


I.5^xl0~l 


Simulation 


R=3/l*,  K=6 

Code 


Computed 
Bound1",  P. 

D 


Simulation 
Result,  P, 


R-l/5 ,  K=l* 
Code 


1.03xl0-2 

2.59x10“- 

5.1*6x10“" 

V,;o 

3dS 

idB 

•  5dB 

3. 53X10--3 

2.35x10“^  • 

1.15xl0“5 

1.80xl0“2 

2.91x10" 3 

-L 

1.72x10 

V*'o 

3dB 

idB 

5d3 

1. 50x10“ 3 

_L 

2.00x10 

l.oOxlO--7 

2.11xl0-3 

3 

1* .  08xl0-li 

2.53xl0-5 

Bound  ,  P. 

D 


Simulation 
Result,  P 


f  Computed  Bound  Evaluated  from  Eon.  (l**)»with  P  =1/(2  -l) 

m 
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Performance  Evaluation  for  Selected  Rational  Pane 
Convolutional  Codes;  Viterbi  Decoding  on  an  AVGIT 
Channel  Employing  a  BPSK  Modulation  Strategy. 


PART  IV 

MODELING  AND  ANALYSIS  OF  SELECTED  RECEIVER  PERFORMANCE 


IN  IMPULSIVE  NOISE  CHANNELS 


I.  Introduction: 


A  considerable  body  of  knowledge  exists  concerning  the  performance  of 
digital  communication  systems  operating  on  the  additive  white  Gaussian 
noise  (AWGN)  channel  (cf.  [l]-[U]  for  comprehensive  treatment).  The  major 
impetus  for  this  concentration  on  the  AWGN  channel  has  been  the  justi¬ 
fication  of  the  Gaussian  model  on  physical  grounds  for  a  large  number  of 
important  applications;  the  deep  space  channel  most  notably.  No  doubt,  the 
comprehensive  coverage  in  this  case  has  been  made  possible  by  the  analytical 
tractability  of  the  AWGN  model.  In  an  increasing  number  of  important  appli¬ 
cations,  however,  the  Gaussian  model  provides  an  entirely  inappropriate 
description  of  the  prevailing  noise  environment.  A  number  of  such  appli¬ 
cations  include:  communication  systems  operating  in  the  ELF/VLF  frequency 
range;  data  transmission  on  the  switched  telephone  network;  wideband  FM 
systems  operating  close  to  or  below  threshold;  VHF/UHF  communication  systems 
operating  in  the  vicinity  of  large  metropolitan  areas;  HF  and  Troposcatter 
long-distance  communication  systems;  and  military  communication  systems  sub¬ 
ject  to  wideband  jamming. 

VThile  the  noise  producing  mechanism  is  decidedly  different  in  each  of 
the  above  examples,  in  each  case  the  noise  environment  can  be  described  as 
a  linear  combination  of  a  Gaussian  and  an  impulsive  noise  process.  This  is 
a  fairly  interesting  class  of  random  processes  and,  as  indicated  above, 
rather  rich  in  practical  applications.  Furthermore,  there  is  ample  justifi¬ 
cation  for  this  model  on  both  physical  and  mathematical  grounds.  Finally, 
with  some  assumptions  on  the  statistics  of  the  impulse  process,  this  class 
of  random  processes  possesses  a  certain  degree  of  analytical  tractability. 


This  paper  then  is  concerned  vith  the  modeling  and  analysis  of  the 
error  probability  performance  of  selected  modulation  strategies  operating 
in  narrowband  impulsive  noise.  A  generalized  impulsive  noise  model  is 
described  which  consists  of  a  linear  combination  of  a  narrowband  shot  noise 
process  and  AWGN.  The  low-density  shot  noise  is  generated  by  exciting  a 
fixed  linear  dynamical  system  by  a  stationary  point  process  (s.p.p.)  de¬ 
scribed  in  terms  of  its  interarrival  distribution.  We  consider  the  part¬ 
icular  case  of  Gamma  distributed  interarrival  times.  Some  of  the  low-order 
statistical  properties  of  this  model  are  described  in  conveniently  parameter¬ 
ized  form  which  allows  calibration  to  a  wide  variety  of  applications,  such 
as  described  above. 

The  receiver  structures  considered  consist  of  a  conventional  data  demod¬ 
ulator,  implemented  as  a  linear  matched  filter,  preceded  by  nonlinear  front- 
end  processing.  Performance  results  are  described  for  several  modulation 
strategies  including  binary  coherent  phase-shift  keying  (BPSK),  differentially 
coherent  phase-shift  keying  (DPSK),  quadrature  phase-shift  keying  (QPSK), 
binary  noncoherent  frequency-shift  keying  ( BFSK ) ,  and  coherent  minimum  shift 
keying  (MSK).  These  results  are  carefully  parameterized  by  the  channel  and 
receiver  characteristics.  Analytical  results  are  provided  where  possible, 
otherwise  we  have  made  extensive  use  cf  computer  simulation  results  obtained 
on  the  R.P.I.  Interactive  Communications  Simulator  (ICS)  as  described  in  [5]. 
This  is  an  extensive  hardware/software  system  allowing  realistic  and  flexible 
simulation  of  a  wide  variety  of  point-to-point  digital  communication  links. 

This  paper  is  organized  as  follows:  We  describe  the  channel  signal  and 
noise  model  in  Section  II,  followed  by  a  treatment  of  selected  low-order 
statistical  properties  of  the  noise  model  in  Section  III.  This  latter 


information  is  useful  in  calibrating  the  assumed  noise  model  against  real- 
world  impulsive  noise  situations.  We  do  not  attempt  such  a  calibration 
here  but  rather  provide  a  careful  parameterization  of  the  problem  which 
hopefully  can  be  applied  to  a  wide  variety  of  impulsive  noise  environments. 

A  comprehensive  treatment  of  the  performance  of  linear  matched  filter  re¬ 
ception  in  impulsive  noise  is  provided  in  Section  IV.  These  results  illus¬ 
trate  both  quantitatively  and  qualitatively  the  severe  degradation  in  per¬ 
formance  due  to  small  amounts  of  impulsive  noise.  In  Section  V  we  demon¬ 
strate  the  effectiveness  of  selected  nonliqear  receivers  In  this  environ¬ 
ment.  The  receiver  structures  here  consist  of  linear  front-end  processing 
together  with  selected  zero-memory  nonlinear  (ZMNL). devices  motivated  by  opti¬ 
mum  weal;  signal  results.  A  summary  and  conclusions  can  be  found  in  Section  VI 
II.  Channel  Signal  and  Hoise  Model; 

In  what  follows  we  make  extensive  use  of  complex  narrowband  representa¬ 
tions  of  all  signal  and  noise  processes.  Specifically,  the  received  signal 
is  given  by 

r(t)  =  Re{?(t)eJ2llfct}  ,  (l) 

where  f  is  an  assumed  known  carrier  frequency  in  Kz,  and  f(t)=r  (t)-Jr  (t) 
is  the  complex  envelope  expressed  in  terms  of  lowpass  waveforms  rc(t)  and 
rs(t)  representing  the  inphase  and  quadrature  (I/Q)  components,  respectively. 
The  complex  envelope  ?(t)  is  furthermore  assumed  of  the  form 

?(t)  =  §(t)  +  fi(t)  ,  (2) 


where  §(t)  is  the  complex  envelope  of  the  received  signal  component  and  fi(t) 
is  a  complex  noise  process  representing  the  non-Gaussian  impulsive  noise 
environment.  We  consider  each  of  these  components  separately. 


A.  Cho.fl.ncJL  S-Lq txoj,  ModcJL:  Ve  assume  the  signal  energy  is  giver,  by 


|  s(t ) | 2dt  , 


where  Tg  is  the  channel  symbol  (baud)  interval.  Furthermore,  ve  restrict 
attention  to  digital  signaling  formats  such  that  s(t)  has  the  representation 


§(t)  l  fCl,iuc(t_2kiTs'T)+JC2,ius(t~2iliTs“T)1 


e*6  .  (li) 


Here,  depending  upon  the  modulation  strategy  emnloved,  u  (t)  and  u  (t)  are 

c  s 

appropriately  defined  baseband  waveforms  modulating  the  I/Q  rails.  Similar- 
ly,  the  quantities  C..  .  ,  C0  .  depend  uoon  the  binary  data  seauence  {x.} 
to  be  transmitted  in  a  manner  specific  to  the  modulation  strategy  in  use. 

The  quantity  k  in  (M  assumes  the  values  0  or  1,  with  the  choice  k=l  gener¬ 
ally  corresponding  to  the  case  of  staggered  overlapping  baseband  waveforms 
as  in  MSK.  Finally,  the  quantities  t  and  6  in  (M  are,  in  general,  random 
timing  epoch  and  phase  offset,  respectively.  In  what  follows  wr  neglect  the 
effects  of  symbol  synchronization  and  phase  tracking  errors,  although  it  is 
expected  that  in  practice  these  effects  will  have  a  profound  influence  on 
overall  system  performance. 

Some  specific  examples  of  digital  modulation  formats  which  can  be  ex¬ 
pressed  in  the  form  (k)  are  provided  below.  A  summary  of  such  representa¬ 
tions  is  given  in  Table  1. 

CohCAZflX.  BFSfC:  Here  the  complex  signal  envelope  is  of  the  form 

S(t)  =J~^  I  x.uo(t-iTs-T)eJ0  ,  (5) 

t  We  specifically  restrict  attention  to  equal  energy  signaling  alphabets, 
tt  In  what  follows  we  assume  x^=±l. 


t  . 


where  the  binary  data  sequence  {x^  assumes  values  ±1,  and  uQ(t)  is  the 
unit  pulse  waveform 


;  -T  /2<t<T  /2 
s - s 

;  elsewhere. 


(6) 


Thus,  s(t)  in  this  case  is  of  the  form  (k)  with  C  .=x.,C0  =0,k=0,  and 

u  (t)=u.(t)  while  u  (t)=0. 
c  0  s 

Noncoh&i&ni  BF SK:  In  this  case,  again  assuming  x.=±l,  the  complex  signal 
envelope  is  given  by 

s(t)  =  /t^  £  [uc(t-iTs-T)+,3xius(t-iTs-T)]e'38i  ,  (7) 

»  is  i 

where1  { 6^ }  is  an  independent  and  identically  distributed  (i.i.a.)  sequence 
uniformly  distributed  over  [-11,11].  The  baseband  modulating  waveforms  are 
now  given  by 

u  (t)  =  cos(Au>t/2)u  (t),  (8a) 

c  u 

and 

u  (t)=sin(hut/2)un(t)  ,  (8b) 

s  u 

with  ^(t)  described  by  (6).  Here.  Aw  is  the  tone  spacing  in  radians/second, 
and  is  a  multiple  of  2ir/Ts  to  insure  orthogonality.  The  instantaneous  radian 
frequency  is  then  2rfc±Aw/2  during  successive  baud  intervals.  Again, §(t)  is 
of  the  form  (1*)  with  C.  .«eJ6i  ,  C0  .=x.e36i  ,  k«0,  and  u  (t),  u  (t)  defined 

by  (8). 

Coherent  QPSK:  The  complex  envelope  of  the  transmitted  signal  is  now  of  the 


§(t)  =/-?!  tx?-iUo(t-iVT)+Jx2i+lUs(t‘iVT)]e 


t  Without  the  sequence  {6,},  as  described  here,  the  phase  could  be  estimated 
on  the  basis  of  past  transmissions  thereby  violating  the  assumption  of  non' 
coherent  reception. 
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•where  in  this  case 


i 


uc(t)=us(t)=(i/v^)u0(t)  . 


(10) 


Note  the  factor  2^,  where  is  the  energy  per  hit,  since  now  two  symbols 
are  transmitted  for  each  channel  use.  The  QPSK  signal  can  be  considered 
the  sum  of  two  independent  BPSK  signals  in  quadrature.  Clearly,  s(t)  is 
of  the  form  (1*)  with  E^E^,  C^x^  ,  C2>i**jc2i+1,  k=0,  and  uc(t),  ug(t) 
given  by  (10). 

Coh&XtnZ  M SK:  The  complex  envelope  of  the  KSK  signal  can  be  represented 

in  the  form 

(11) 


(t)  =Jt1  l  [x2iUc(t-2iVT)+jX2i+luS(t-2iVT)^ 


je 


s  1 


2i+l  s 


where  now 


ufi(t)  =  cos(7:t/2Ts)u^(t)  ,  (12a) 

and 

u  (t)  =  sin(irt/2T  )ui(t-T  )  ,  (12b) 

c»  S  U  S 

with  u^(t)=urt(t/2) ,i.e.  , 

1  ;  -T  <t<T 

s - s 

0  ;  elsewhere  .  (13) 

Representation  in  the  form  (U)  is  rather  obvious  where  we  note  that  now 
k=l  is  required. 

B.  Channel.  No-cie.  ModzZ:  The  complex  envelope  n(t)  of  the  additive  noise  is 

assumed  modeled  as  a  linear  combination  of  a  generalized  shot  noise  process, 

typically  low  density  and  WGN.  In  particular, 

n(t)=y(t)+w(t)  ,  (lM 

where  w(t)=v  (t)-jw  (t)  is  a  zero-mean  complex  WGN  process  with  I/Q  components, 
c  s 

wc(t)  and  wg(t),  respectively,  mutually  independent  lowpass  WGN  processes  each 


;(t)  = 


.possessing  double-sided  noise  spectral  densi-y  K J2  vatts/Kz.  The  shot  noise 

u 

component  y(t)  accounts  for  the  relatively  infrequent  high-level  and  time- 
resolved  impulse  noise  hits  due  to  atmospheric  discharge  phenomenon,  or  vari¬ 
ous  contributions  due  to  man-made  noise  or  pulse  jamming.  The  Gaussian  noise 
component  w(t),  on  the  other  hand,  is  due  to  a  combination  of  front-end  noise 
and  the  larger  number  of  low-level  and  overlapping  impulse  hits.  The  latter 
contribution  is  expected  to  exhibit  Gaussian  behavior  from  central  limit 
theorem  considerations. 

The  shot  noise  component  can  be  modeled  as  the  output  of  a  linear  time- 
invariant  filter  excited  by  a  complex  amplitude  modulated  impulse  train  or 
point  process  at  its  input.  The  filter  generating  the  complex  shot  noise 
process  y(t)  is  assumed  to  possess  impulse  response  h(t),  or  equivalently 
system  transfer  function  H(s).  A  block  diagram  of  the  channel  noise  model 
is  illustrated  in  Fig.  1.  The  impulse  response  h(t)  will  in  general  be  com¬ 
plex  possessing  an  inphase  component  h  (t)  and  a  quadrature  component  b  (t). 

c  s 

The  complex  input  d(t)  to  the  linear  filter  generating  the  shot  noise  com¬ 
ponent  is  assumed  of  the  form 

H(t) 

a(t)  =  l  u .  6(t-t . )  .  (15) 

i=l 

Here  (d^)  is  an  appropriately  defined  complex  i.i.d.  weighting  sequence  and 
(N(t),  t^O)  is  a  point  process,  in  particular  a  counting  process  [6]-[8], 
whose  event  times  are  described  by  the  sequence  {t^}.  We  will  be  particularly 
interested  in  stationary  renewal  counting  processes  possessing  Gamma  distri¬ 
buted  interarrival  times  with  probability  density  function  (p.d.f.) 

v— 1 

f(x)  =  — -  exp{-x/e)  ;  x>0  ,  (l6) 

r(v)e 
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where  v>l,  and  the  parameter  6  is  defined  according  tc  6=1/ vX  with  X_>0  fixed. 
The  quantity  X  represents  the  average  number  of  events  per  unit  tine.  This 
quantity  will  generally  be  specified  in  units  normalized  to  the  baud  duration 
Tg  seconds.  Specification  of  the  two  parameters  v  and  XTg,  then,  completely 
defines  the  point  process  {N(t),  t>0). 

For  example,  if  v=l  then 

f(x)  =  Xe-Xx  ;  x_>0  .  (IT) 

This  is  the  exponential  distribution.  In  this  case,  (N(t),  t>0}  is  Poisson 
with  X  the  average  number  of  hits  per  unit. time.  Another  special  case  is 
obtained  by  letting  v-*®  while  holding  X  fixed.  The  result  is 

f(x)  =  6(x-l/X)  .  (18) 

In  this  case,  the  impulses  are  periodic  with  rate  X  per  unit  time.  By  intro¬ 
ducing  the  class  of  driving  point  processes  described  here,  we  have  a  conven¬ 
iently  parameterized  impulsive  noise  model  which  includes  these  two  extremes 
as  special  cases.  The  class  of  point  processes  (N(t),  t>0},  and  hence  u(t), 
is  easily  generated  on  a  digital  machine. 

In  what  follows  we  will  assume  that  the  complex  impulse  amplitudes  are 
-16  .  , 

of  the  form  u^R^e  i  with  {R..  )  an  i.i.d.  sequence  possessing  common  p.d.f. 
fr(’),  while  {e.}  is  likewise  an  i.i.d.  sequence  uniformly  distributed  over 
[-it, it].  Although  many  choices  for  f  (•)  are  possible,  we  will  restrict 
attention  to  two  which  are  specifically  motivated  by  related  ELF/VLF  noise 
modeling  work  [9]-[l2],  The  first  choice  is  the  power-Rayleigh  or  Wiebull 
distribution  with 

fp(R)  =  ~  Ha_1  exp{-(R/R0)a}  ;  R>0,  (19) 

R0 

where  the  characteristic  exponent  a  lies  in  the  range  0<a<2  and  Rq  is  a 
scale  parameter  given  by 


E*E2}  T 
R0  =  r(l+2/a) 


Another  useful  choice  is  the  lognormal  distribution  with 


f  (R)  =  — - —  exp  - 
r  »^oR 


UnR-y)2! 

2o<  j 


;  R>0  . 


Here  u  and  o2  are  the  mean  and  variance,  respectively,  of  a  Gaussian  variate 

g 

g  for  vhich  R=e  .  In  this  case,  for  fixed  value  of  y,  it  is  convenient  to 
describe  the  scale  parameter  in  terms  of  the  quantity 

V10  loSl0FT17  =  ^3o2  .  (22) 

These  tvo  distributions  are  sufficiently  general  to  exhibit  the  broad-tailed 
distributions  typical  of  many  observed  impulsive  noise  situations. 

Under  the  preceding  assumptions,  it  follows  that  the  I/Q  components  of 
the  shot  noise  process  can  be  generated  as  illustrated  in  Fig.  2  according  to 

N(t) 

y  (t)=  l  R. [h  (t-t  )cos6  -h  (t-t  )sin6  ] ,  (23a) 

C  ^  C  X  1  S  X  X 


N(t) 

Y  ft)=  l  R. [h  (t-t , )cos8  +h  (t-t  )sin6.] 
«  i“l  1  s  1  ic  l  l 


(23b) 


Here  h  (t)  and  h  (t)  are,  as  described  previously,  the  I/Q  components  of  the 

C  5 

complex  impulse  response  function  h(t).  We  will  assume  thau  h  (t)  and  h  (t) 

c  s 

are  both  equal  to  the  same  impulse  response  function  hg(t).  Several  choices 
for  hQ(t)  are  possible.  For  example,  the  simplest  choice  is  the  pulse  wave¬ 
form  of  duration  Tc  seconds  as  illustrated  in  Fig.  3.  The  corresponding  im¬ 


pulse  response  is 


hQ(t)  =  u_1(t)-u_1(t-Tc)  , 


where  u  ^(t)  is  the  unit  step  function 


4-10 


u_i(t)= 


t<0  . 


Generally  it  vill  be  the  case  that  XTc<<1  so  that  the  resulting  shot  noise 
process  is  truly  low-density.  Other  choices  of  characteristic  pulse  shape 
h  (t)  are  provided  in  Table  2  together  with  some  other  pertinent  parameters 
to  be  described  in  what  follows. 

Finally,  the  last  component  of  the  impulsive  noise  model  to  be  specified 
is  the  quantity 


which  represents  the  ratio  of  the  mean-square  value  of  the  shot  noise  compon¬ 
ent  to  the  double-sided  noise  spectral  density  of  the  AWGR  component.  The 
units  of  y  are  in  Hz,  or  equivalently  sec  .  In  what  follows  we  will  be 
more  interested  in  the  dimensionless  quantity  Y2ts»  representing  the  power 
in  the  shot  noise  normalized  to  that  of  AWGK  component  when  measured  in  a 

double-sided  noise-bandwidth  equal  to  the  baud  rate  f  ~1/1  . 

s  s 

For  a  fixed  choice  of  model  parameters  the  scale  parameters  associated 
with  the  amplitude  p.d.f.  fy(*  )  can  be  adjusted  to  achieve  a  specified  value 
of  y2Tg.  In  particular,  it  is  easily  shown,  following  the  approach  in  [13]- 
[15],  that  the  stationary  input  point  process  il(t)  described  by  ( 15 )  and  the 
sequel  possesses  power  spectral  density  S^(w)=XE{R2}  independent  of  the 
value  of  v.  It  follows  that  the  power  spectral  density  of  the  low-density  shot 
noise  process  is  given  by 


S_^(u>)=2XlH0(  ju>)  S  2E{B2>  , 


from  which  we  obtain 


E{|y(t) 1 2}=UXB Ieq(o)  1 2E{R2 }  . 


U-ll 


Here 


i 

LV-‘ 

L*[  .* 

L" 


m 

KV 

L/. 

* 


f  CD 

iwL  iy»i^ 

B  =  — 


2|Hq(0) j  2 


(29) 


is  the  equivalent  single-sided  rectangular  bandwidth  in  Hz  and  is  provided 
in  Table  2  for  each  of  the  characteristic  pulse  shapes  considered.  Also 
provided  in  Table  2  is  the  mean-square  value  E{  | )  |  2 }  of  the  shot-noise 
component  where  all  parameters  are  conveniently  expressed  in  dimension¬ 
less  form  allowing  explicit  evaluation  of  E{R2}  ,  and  hence  the  scale 

parameter  associated  with  f  ( • ) ,  for  a  fixed  value  of  y2T  . 

r  s 

III.  Statistical  Characterization  of  Noise  Model: 

In  this  section  we  describe  some  of  the  low-order  statistics  which  pro¬ 
vide  a  partial,  although  useful,  characterization  of  the  impulsive  noise 
model  described  in  the  preceding  section.  Since  most  of  the  literature  on 
noise  measurements,  particularly  on  the  measurement  of  atmospheric  noise, 
have  dealt  almost  exclusively  with  the  envelope  process  A(t)=|n(t)|,  we  con¬ 
centrate  on  envelope  statistics.  While  many  choices  are  possible  we  con¬ 
sider  two  envelope  statistics:  the  amplitude  probability  distribution  (APD), 
and  the  average  level  crossing  rate  (ALCR).  These  two  statistics  are  not 
only  illustrative  of  impulsive  noise  effects,  but  data  are  generally  avail¬ 
able  for  gross  calibration  purposes.  We  consider  each  of  these  statistics 
separately. 

A.  A mpLLtudt  PKobabiLLty  VZi&ubution  (A PV) :  If+  f  (•)  represents  the  first- 
order  p.d.f.  of  the  envelope  process  A(t),  then  the  APD  is  merely  the  prob¬ 
ability  » 

Qa(*)  =  j  **aU)d*  .  ( 30) 

_ x _ 

t  We  will  assume  steady-state  conditions  and  suppress  any  explicit  function¬ 
al  dependence  upon  time  in  describing  first-order  distributions. 
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of  exceeding  the  fixed  level  x.  In  evaluating  APD's,  as  veil  as  ALCR's, 
it  is  convenient  to  normalize  the  fixed  level  x  to  the  rms  value  of  the 
envelope.  Because  of  the  presence  of  the  AWGN  component,  however,  this 
rms  value  is  unbounded.  Thus.it  is  necessary  in  evaluating  APD's  and  ALCR's 
to  assume  that  the  noise  process  n(t)  is  observed  through  a  fictitious  measure¬ 
ment  filter  of  finite  single-sided  equivalent  rectangular  bandwidth  B'  Hz. 

The  question  of  choosing  an  appropriate  bandwidth  for  evaluating  APD's 
poses  some  difficulties.  In  particular,  if  the  measurement  bandwidth  B' 
is  not  appreciably  larger  than  that  of  the. shot  noise  component  then  the 
actual  shape  of  the  APD  curve  can  be  expected  to  depend  upon  B'.  If,  on  the 
other  hand,  B'  is  much  larger  than  the  bandwidth  of  the  shot  noise  compon¬ 
ent  then  the  shot  noise  is  unaffected  by  the  presence  of  this  fictitious 
measurement  filter.  In  this  case,  changing  B’  merely  allows  more  or  less 
AWGN  thereby  scaling  the  APD  curves  up  or  down  without  affecting  the  shape. 
Similar  comments  apply  in  the  case  of  ALCR's.  We  assume  then  that  B’>>B, 
with  B  the  equivalent  rectangular  bandwidth  of  the  shot  noise  component 
given  by  (29).  The  total  mean-square  value  of  the  "filtered"  impulsive 
noise  3'(t)  is  then  given  by 


a2,=  a 2  +  o?t  .  (31) 

fi'  y  v' 

where  a2  =  E{|?(t)|2}  is  unaffected  by  the  action  of  the  filter  and 

o2,  =  2NqB'.  We  now  proceed  to  an  evaluation  of  the  APD  for  this  "filtered" 

impulsive  noise  process. 

From  (30), the  difficulty  in  evaluating  the  APD  is  directly  related  to 


the  ability  of  determining  the  p.d.f.  f  (•).  While  the  exact  evaluation 

GL 

of  f  (•)  is  extremely  difficult  in  general,  evaluation  of  the  characteristic 

a 
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function  (ch.f.)  is  relatively  straight forward  in  some  special  cases. 

This  allows  evaluation  of  f  (•),  and  hence  Q  (•),  by  numerical  techniques. 

The  joint  ch.f.  of  the  I/Q  components"'"  n’(t)  and  n'(t),  respectively, 

c  s 

is  given  by 

V  n'  ^V1,V2^  +  ^n^t))]} 

c’  s 

=  *y  ,y  <vi>Ts>*vJ,.'  <VV  •  (32) 

c  s  12 

where  we  have  made  use  of  the  independence  of  the  shot  noise  and  WGN  com¬ 
ponents.  Clearly, 

*v’,»’<T1'V2)  "  elp{-  “I"  [vl+V2]) 

=  exp{-a2B2/2)  ,  (33) 

V 

where  for  convenience  we  have  defined  o2  =  KB'  and  B=/  v2+v2  .  The 

w  u  1  d 

quantity  o^  =  o?,/2  represents  the  common  variance  of  the  I/Q  components 
of  the  filtered  background  Gaussian  component  tt'(t).  Similarly,  the 
joint  ch.f. 

>y  (v1,v2)=E{exp[j(v^rc(t) +  v2yg(t))]}  ,  (3*0 

c  s 

of  the  shot  noise  component  can  be  shown  to  depend  upon  v1*v2  only  through 

B.  It  follows  that  the  joint  ch.f.  ^  ,  , (v  ,v2)  then  depends  only  upon 

c’  s 

B.  In  this  case  the  envelope  p.d.f.  is  given  by 


/» 

f  (A)  =  ABJ  (&AH  ,  ,  ( B)dB 

a  0  nc’ns 

0  C  3 

00  2  2  / 

=  A8J0(BAH  (B)e"awS  /2dB  , 

*  r  s 


-  We  will  distinguish  the  output  components  of  the  fictitious  measurement 
filter  with  a  prime. 
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Vhere  Jv( * '  is  the  or--nary  Bessel  function  of  the  first  kind  of  order  v, 
and  ve  have  made  use  of  (32)  and  (33)  in  obtaining  this  last  expression. 
Finally,  it  is  easily  established  that  the  APD  given  by  (30)  can  be  eval¬ 
uated  according  to 

Jl(Bx)  ^y  ,y  (6)e‘aJ62/2dB  ,  (36) 

c  s 


,00 


Qo(x)  =  1— x 
a 


Hi 


which  is  readily  evaluated  by  quadrature  given  explicit  expression  for 


V,y  {6)‘ 

c  s 


The  major  difficulty  in  evaluating  either  f&(.)  or  Q  (•)  through  (35) 
and  (36),  respectively,  is  the  general  inability  of  obtaining  concise  ex¬ 
pressions  for  the  joint  ch.f.  *  (S)  of  the  shot  noise  component.  An 

exception  is  the  case  where  (K(t),  t>0)  is  Poisson,  i.e.,  v=l.  In  this 
case  it  is  shown  in  Appendix  A  that 

*yc,ys(B)  ■  eltp{X  |  IVV*5«‘0(’>»  -llaT}  '  (37) 

Here,  the  expectation  appearing  in  the  integral  is  with  respect  to  the 
random  amplitude  F.  of  the  input  pulse  process,  i.e., 

,00 


Er(J0(v^  BRh0(T))}  = 


JQ(>/2SRh0(T))fr(R)dR  , 


(38) 


where  fp(-)  is  given  by  either  (19)  or  (21).  Again  this  quantity  can  be 
evaluated  in  closed  form  only  in  special  cases.  One  useful  case  is  when 
fr(*)  is  Power-Rayleigh  with  characteristic  exponent  a=l.o,  in  which  case 

it  is  easily  shown  by  straightforward  integration  that 

Er{J0(^6Rh0(x))}  =  [1  +  2R^B2h2(T)rJs  .  (39) 

Substitution  into  (37)  for  hQ(t)=e~atu  ^(t)  yields 
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Vys 


(6)  = 


l+/l+(a/X)c2B2 


Here,c^  =  E{|5^(t)|2}  is  the  variance  of  the  shot  component  and  ve  have  made 
use  of  (20)  together  with  Table  2  to  evaluate  the  scale  parameter  R^  assoc¬ 
iated  with  the  pover-Rayleigh  distribution.  It  is  interesting  to  note  that 
in  the  high-density  case,  corresponding  to  large  (X/a),  we  have 

1’v  v  (B)  :  exp{-  |  a2B2)  ,  (fcl) 

yc’  s  y 

where  a2=a?/2  is  the  common  variance  of  the  I/Q  components  of  the  shot  noise 

•7 

process.  As  expected,  the  shot  noise  is  Gaussian  in  this  limiting  case. 

Substitution  of  (1+0)  into  (36)  allows  evaluation  of  the  APD  by  quad¬ 
rature  in  this  special  but  useful  case.  The  results  are  illustrated  in 
Fig.  1*  using  a  61+-point  quadrature  formula.  Here  we  plot  the  envelope  norm¬ 
alized  to  its  rms  value  versus  the  percent  of  time  that  the  ordinate  is  ex¬ 
ceeded.  The  curves  are  specifically  designed  so  that  a  Gaussian  distribu¬ 
tion  (Rayleigh  envelope)  plots  as  a  straight  line  of  slope  -1/2.  Observe 
that  the  APD's  do  appear  Gaussian  at  low  levels  and  depart  considerably 
from  Gaussian  behavior  at  high  levels.  Also  note  that  at  low  levels  the 

curves,  although  parallel  to,  are  depressed  below  the  Gaussian  curve  (y2T  =0) 

s 

The  reason  for  this  is  that  we  have  chosen  to  normalize  the  envelope  to  the 
total  rms  level.  From  (31)  we  observe  that 


E{A2(t)}  =  E{ ]ft(t) | 2} 

-*  H 


so  that  in  the  low-level  Gaussian  region  the  curves  can  be  expected  to  be 
depressed  below  the  straight-line  Gaussian  characteristic  by  approximately 


(1*3) 


V  10  lo6i0 


r1+i!vi 

L  2B'T  J  ’ 


which  is  consistent  with  Fig.  1*. 

Also  included  in  Fig.  1»  are  simulation  results  obtained  on  the  previous¬ 
ly  described  ICS  [5).  These  results  have  been  obtained  using  a  minimum  of 
10 7  samples  in  each  case.  As  can  be  seen  the  agreement  with  the  computed 
results  are  excellent.  In  more  general  cases,  the  numerical  evaluation  of 
APD's  according  to  (36)  has  proven  extremely  difficult.  The  major  difficulty 

is  in  obtaining  closed  form  expressions  for  the  Joint  ch.f.  (B).  In- 

yc  ‘ys 

deed,  even  in  the  Poisson  case  evaluation  of  i|i  (B)  according  to  (37)  is 

yc  *ys 

difficult  and  is  quite  hopeless  for  non-Poisson  input  processes,  i.e.,  v>l. 
Because  of  this  general  intractability  of  analytical  evaluation  of  AFD's, 
and  the  relative  ease  with  which  accurate  simulation  results  can  be  obtain¬ 
ed  we  will  rely  exclusively  upon  the  latter  in  what  follows. 

In  Fig.  5  we  illustrate  simulation  results  for  the  same  situation  as 
in  Fig.  1*,  except  now  a=0. 5.  Note  the  much  higher  probability  of  exceeding 
large  levels  indicating  much  broader  tails  to  the  envelope  p.d.f.  Also 

note  that  even  small  values  of  f2T  (3dB  say)  can  have  a  pronounced  effect 

s 

on  the  high-amplitude  region  of  the  APD.  In  Fig.  6  we  illustrate  the  effect 

of  varying  B'T  with  y2T  *20dB  and  conditions  otherwise  the  same  as  in  Fig.  5. 
s  s 

The  APD's  approach  the  Gaussian  curve  in  the  low-amplitude  region  with  in¬ 
creasing  B'T  as  predicted  by  (1*3).  Clearly,  increasing  B'T  results  in 
s  s 

more  of  the  AWGN  component  being  passed  by  the  fictitious  measurement  filter 
thereby  contributing  more  heavily  to  the  total  mean-square  noise.  Observe 
that  provided  B'>B,  as  is  indeed  the  case  in  Fig.  6,  the  APD  is  affected 
very  little  in  the  high-amplitude  region  as  B'Tg  is  increased. 


The  effect  of  varying  the  parameter  aT  is  illustrated  in  Fig.  7, 

s 

again  for  pover-Rayleigh  pulse  amplitudes  with  a=0.5,  XT  =0.1  and  y2T  = 

s  s 

20dB.  For  small  aTg  there  is  considerable  pulse  overlap  since  then  X/a>l 

and  the  APD  resembles  that  of  Gaussian  noise  as  might  be  expected  from 

central  limit  theorem  considerations.  For  larger  aT  values  the 'individ- 

s 

ual  pulses  are  resolved  and  the  APD  departs  drastically  from  Gaussian  be¬ 
havior  in  the  large-amplitude  region.  Similar  comments  apply  to  Fig.  8, 

where  now  aT  is  fixed  and  the  normalized  pulse  arrival  rate  XT  is  varied, 
s  s 

Again,  for  large  XT  ,  or  equivalently  X/a>>l,  the  APD  resembles  that  of 

s 

Gaussian  noise  due  to  the  relatively  large  number  of  pulse  overlaps. 

In  Fig.'s  9  and  10  we  illustrate  the  effect  of  varying  a  and  V.  for 

d 

pover-Rayleigh  and  lognormal  pulse  amplitudes, respectively.  It  is  clear 
how  by  choice  of  either  a  or  the  behavior  in  the  large- amplitude  region 
can  be  controlled  with  little  effect  on  the  small- amplitude  region.  Indeed, 
by  appropriate  choice  of  parameter  values  the  corresponding  APD's  for  both 
pover-Rayleigh  and  lognormal  distributed  pulse  amplitudes  can  be  made  to 
agree  quite  well,  at  least  for  a  range  of  parameter  values.  This  is  illus¬ 
trated  in  Fig.  11  where  the  values  of  a  and  have  been  chosen  to  provide 
the  best  correspondence  for  other  parameters  held  fixed. 

Finally,  in  Fig.'s  12  and  13  we  illustrate  the  effect  of  changing  the 
characteristic  pulse  shape  and  pulse  interarrival  statistics,  respectively. 
In  Fig.  12  we  illustrate  that  the  pulse  shape  has  little  effect  on  APD's 
since  there  is  no  noticeable  difference  for  a  variety  of  characteristic 
pulse  waveforms  selected  from  Table  2.  Similarly,  from  Fig.  13  we  see  that 
varying  the  pulse  interarrival  statistics.vhile  maintaining  XTg  constant, 
has  little  effect  on  the  APD. 
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S.  Aug/tagg.  LtvcZ  CA.0644yig  Rfl-tg  (A LCR) :  Consider  the  average  number  N+(u) 

of  uperossings  per  second  of  a  fixed  level  u  by  the  envelope  process  A(t)= 

|2'(t)|  appearing  at  the  output  of  the  fictitious  measurement  filter.  The 

ALCR  is  then  the  quantity  N  (ulT^,  representing  the  average  number  of  up- 

crossings  of  the  level  u  in  a  baud  or  symbol  period  of  duration  T  sec. 

In  general,  explicit  evaluation  of  the  ALCR  for  the  impulsive  noise 

model  described  in  the  preceeding  section  is  extremely  difficult,  if  at 

all  possible.  However,  in  the  particular  case  where  n'(t)  is  a  complex 

zero-mean  Gaussian  process  (i.e.,  y2T  =0)  with  rms  bandwidth  B„  Kz,  and 

s  0 

variance  E{ |fi* (t ) | 2}=E{A2(t ) }  =  AQ,  we  have  [l6],  [17], 

H+(u)Ts=^7  B0Ts(u/AQ)exp{-|{u/A0)2}  ',  (UU) 

where 

2"B0 

and  SnQ( • )  is  the  common  power  spectral  density  of  the  assumed  independent 

I/Q  components.  The  ALCR  is  then  readily  evaluated  as  a  function  of  the 

normalized  level  u/A-  for  selected  choices  of  B_T  .  For  example,  if  the 

o  US" 

measxurement  filter  is  flat  over  the  interval  [-B',B']  we  have  BQ=B ' //3 
allowing  explicit  evaluation  of  the  ALCR  according  to  (M)  as  a  function 
of  3'Ts  where  now  A^=  /2HqB'  .  In  Fig.  1^  we  illustrate  computed  results 
for  selected  values  of  B'T  together  with  simulation  results  collected  on 
the  ICS.  In  all  cases  the  simulation  results  are  obtained  on  the  basis  of 

7 

at  least  10  samples.  As  can  be  seen  the  agreement  between  computed  and 
simulated  results  is  excellent.  Because  of  this  close  correspondence,  and 
+  Here  B'  is  the  equivalent  single-sided  rectangular  bandwidth. 
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the  general  intractability  of  the  ALCR  in  r.on-Gauscian  situations,  we  will 
make  exclusive  use  of  simulation  results  in  what  follows.  Finally,  before 
leaving  this  special  case,  we  should  note  that  even  if  y(t)  were  Gaussian 


the  ALCR  would  depend  not  only  upon  Y2Tg  but  in  a  very  complicated  way  upon 

B'T  and  BIT  ,  where  B'  is  now  the  rms  bandwidth  of  the  shot  noise  compon- 
s  0  s  0 

ent.  In  particular,  it  is  easily  seen  that  the  ALCR  is  still  given  by  (44) 
where  now 


B  =  — 
0  /3 


2rp  ~| 


3y"T 

1+(bo/b’)2^tT 


y2t 


1+ 


2B'T 


In  Fig.  15  we  illustrate  the  ALCR  for  impulsive  noise  under  selected 
parameter  choice  for  various  values  of  Y2Ts>  The  parameters  correspond  to 
the  APD's  illustrated  in  Fig.  5.  As  can  be  observed  the  presence  of  the  shot 
noise  component  results  in  considerably  increased  crossing  rates  relative  to 
the  case  of  Gaussian  noise  alone  (y2Ts=0)  in  the  large-amplitude  region.  In 
the  low-amplitude  region  the  curves  are  parallel  to  the  Gaussian  character¬ 
istic  as  to  be  expected,  except  they  are  displaced  to  the  left.  Again,  this 
is  due  to  the  fact  that  we  have  chosen  to  normalize  the  abscissa  to  the  total 
rms  value  in  which  case  the  curves  can  be  expected  to  be  displaced  by  an 
amount  given  by  (43).  This  is  illustrated  also  in  Fig.  16  where  we  vary 

B'T  with  y2T  =20dB  and  conditions  otherwise  the  came  as  in  Fig.  15* 
s  s 

The  effect  of  varying  the  parameter  aTg  is  illustrated  in  Fig.  IT,  corr¬ 
esponding  to  the  APD  plots  in  Fig.  7-  Again,  for  small  aTg  there  is  consider¬ 
able  pulse  overlap  and  n'(t)  is  approximately  Gaussian.  The  ALCR  then  converg 
es  to  the  dotted  curve  which  is  computed  according  to  (44)  where  BQ  is  now  e- 
valuated  from  (46).  Similar  comments  apply  to  Fig.l8  where  aT  =2.0  is  fixed 


and  the  normalized  pulse  arrival  rate  XTg  is  varied.  Again,  as  for  the  APD's 
in  Fig.  8,  for  large  XT  ,  or  equivalently  X/a»l,  the  ALCR  resembles  that  for 
Gaussian  noise  due  to  the  relatively  large  number  of  pulse  overlaps. 

In  Fig.'s  19  and  20  we  illustrate  the  effect  of  varying  a  and  V,  corres- 

d 

ponding  to  Fig.'s  9  and  10,  respectively.  Again,  it  is  clear  how  the  choice 
of  either  a  or  Vd  effects  the  behavior  in  the  large-amplitude  region  with 
little  noticeable  effect  on  the  small-amplitude  nearly-Gaussian  region.  In 
Fig.  21  we  illustrate  how  the  values  of  a  and  can  be  chosen  to  provide  a 
close  match  in  ALCR  corresponding  to  the  APD  results  illustrated  in  Fig.  11. 

Finally,  in  Fig.'s  22  and  23  we  illustrate  the  effect  of  changing  the 
characteristic  pulse  shape  and  interarrival  statistics,  respectively.  These 
results  correspond  to  APD's  illustrated  in  Fig.'s  12  and  13,  respectively. 

As  in  the  case  of  API's,  we  observe  that  neither  the  pulse  shape  or  the  in¬ 
terarrival  statistics  has  much  effect  on  the  ALCR,  provided,  of  course,  XT 

s 

is  maintained  constant. 

IV.  Performance  of  Linear  Matched  Filter  Receivers: 

In  this  section  we  consider  the  performance  of  linear  matched  filter  re¬ 
ceivers  in  noise  environments  characterized  by  the  previously  developed  im¬ 
pulsive  noise  model.  Specifically,  we  evaluate  the  symbol  error  probability 
for  each  of  the  modulation  strategies  described  in  Section  II  and  summarized 
in  Table  2.  It  proves  convenient  to  consider  separately  the  cases  of  co¬ 
herent  (i.e.,  BPSK,  QPSK  and  MSK)  and  noncoherent  (i.e.,  DPSK,  BPSK)  modula¬ 
tion  strategies.  Indeed,  the  decision  variable  is  of  identical  form  for 
each  of  these  two  distinct  cases.  In  general,  the  decision  variable  can 
be  written  as  a  sum  of  three  separate  terms  associated  with  the  signal. 


shot  noise,  and  AWC-N  components,  respectively.  The  error  probability 


conditioned  upon  the  shot  noise  is  easily  determined  as  this  is  merely  the 
corresponding  error  probability  for  the  AWGN  channel.  It  follows  that  the 
overall  error  probability  can  then  be  determined  by  averaging  with  respect 
to  the  shot  noise  component.  This  is  the  approach  to  be  followed  in  sub¬ 
sequent  developments  where  we  neglect  the  effects  of  symbol  synchroniza¬ 
tion  and  phase  tracking  errors. 

A.  Coh.eAe.nt  MocLiZ&tton:  In  this  case  it  is  easily  seen  that  individual 
bit  decisions  are  based  upon  threshold  tests  of  the  sufficient  statistics 
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where  u  (t)  and  u  (t)  are  appropriate  baseband  waveforms  modulating  the 


I/Q  rails  as  described  previously  in  Section  II  where  we  recall  k=l  for  MSK, 


and  k=0  otherwise.  For  example,  in  BPSK  only  the  sequence  (r  is  employ¬ 


ed  to  determine  the  binary  sequence  {x^}  with 


*i  =  Sgnrci  • 


(U8) 


For  either  QPSK  or  MSK,  the  two  sequences  anc*  ^*2i+l^  are  °^‘ta^ne^  *>y 

similar  operations  upon  {r^}  and  {rg^}  *  respectively.  In  general,  by  appro 
priate  normalization,  the  decision  variable  during  any  given  signaling  inter 
val  can  be  expressed  in  the  form 


d  = 


(U9) 


4- 


Here  {£.}  represents  the  hard-dec isioned  output  sequence  corresponding 
to  the  transmitted  sequence  {x.}  . 


Here,  is  the  energy  per  hit,  x=±l  is  a  binary  variable  corresponding  to 
the  transmitted  bit,  v  is  a  zero-mean  unit  variance  Gaussian  variate,  and 


finally 


where 


f2  x 
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0 
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( 51a) 
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and  finally 


A_l_  fS 


®  h0(T-t)Uc(T-2K-1Ts)dT  . 


( 51c) 


In  developing  these  expressions  we  have  made  explicit  use  of  (23)  and  the 
sequel.  The  {t^  in  (50)  are,  of  course,  the  event  times  of  the  underlying 
point  process  (K(t),  t>0). 

In  any  case,  using  ( 1* 9)  the  probability  of  error  under  equiprobable  signal¬ 
ing  can  be  expressed  in  the  form 


■r#‘) 


t  U)«  . 

V 


where  f  (•)  is  the  p.d.f.  of  y  given  by  (50)  and 

y 

Q(x)  =  [  exp{-y2/2)  dy  .  (53) 

Unfortunately,  explicit  evaluation  of  f  (•)  is  extremely  difficult  so  that 

_ v 

+  It  is  easily  established  that  e-1  for  BPSK  while  e  -  h  for  both  QPSK  and 
MSK. 


numerical  evaluation  of  Pft  though  (52)  has  proven  impractical.  In  some  cases, 
however,  the  ch.f.  ij> (*)  is  more  amenable  to  evaluation.  Indeed,  an  alter- 
native  and  more  useful  expression  for  error  probability  in  this  case  is  shown 


in  Appendix  B  to  be  given  by 


P 

e 


-v2/2 

e  dv  , 


(5M 


which  is  readily  evaluated  by  quadrature  given  an  explicit  expression  for 

V>- 

A  useful  special  case  for  which  the  evaluation  of  is  relatively 

y 

straightforward  is  when  {N(t),t>0}  is  Poisson.  In  this  case,  y  in  (50)  is  then 
a  filtered  Poisson  process  with  random  amplitude  sequence  {Z^}  and  character¬ 
istic  pulse  shape  g(t).  It  follows  [6],  [7]  that 

r2kT 

\|<  (v)=exp{A  S[>J'  (vg(T))-l]dT)  ,  (55) 

J  J  z 


where  (•)  is  the  common  ch.f.  of  the  random  amplitude  sequence  {Z.  }.  In 

Z  X 

particular,  following  a  development  similar  to  that  in  Appendix  A  we  easily 


establish  that 

^2(vg(x)) 


(56) 


where,  as  in  (38),  E^*}  indicates  expectation  with  respect  to  the  random 
amplitude  of  the  input  pulse  process  generating  the  shot  noise.  Again, 
this  quantity,  and  hence  (v),  can  be  evaluated  only  in  special  cases. 

y 

One  useful  case  where  ( • )  can  be  explicitely  evaluated  is  when  R  is 

z 

power-Rayleigh  with  characteristic  exponent  a=1.0,  where  from  (39)  we  have 

V'z(vg(-r))  *  (l+U(2kTs/eN0)R2v2g(T)] ,  (57) 

with  Fq  again  the  scale  parameter  associated  with  the  power-Rayleigh  distri- 


bution  given  by  (20).  In  Appendix  C  it  is  shown  that  for  the  particular  case+ 
of  coherent  BPSK  and  vith  characteristic  pulse  shape  hQ(t)=e~atu  we  have 


vith 

and 


^y(v)  =^(v)^(v)  , 

♦y(v)  ^  ^l+A-t-CKV  j  ?S/  S 


/(v)  £  — i±A±EI _ 

y  l+?Kv2+»/(l+;v^)(l+cK/vi£) 


-XT  /aT  /l+Cv2 
s  s 


(58) 

(59a) 

(59b) 


The  two  quantities  ^  (v)  and  ^+(v)  can  be  interpreted  as  the  contributions 

y  y 

from  impulses  occurring  prior  to  the  signaling  interval  and  during  the  sig¬ 


naling  interval,  respectively.  At  any  rate,  substitution  of  this  express¬ 
ion  for  ^  (v)  into  (5M  allows  numerical  evaluation  of  the  restating  error 

y 

probability. 

In  Fig.  we  illustrate  typical  computed  results  using  a  b^-point 

auadrature  formula  for  various  values  of  XT  vith  aT  =U.O  and  y2T  =20cLB. 

■  s  s  s 

The  curves  here  and  in  the  sequel  have  been  plotted  versus  the  quantity 
E^/Kq  which  is  the  energy  per  bit  normalized  to  the  spectral  density  of  the 
AWGN  component.  We  have  also  indicated  in  Fig.  2b  the  error  probability 
performance  in  the  absence  of  impulsive  noise  as  well  as  the  approximate 

XX 

performance  if  all  the  noise  were  Gaussian  and  under  the  assumption' '  3Tg>l. 


+  Actually,  the  indentical  result  holds  for  QPSK  while  the  results  for  KSX 
are  considerably  more  complicated. 

it  Here  B  is  the  equivalent  rectangular  bandwidth  of  the  shot  noise  component. 


Specifically ,  under  the  latter  assumption  the  total  noise  environment  is 
Gaussian  with  a  power  spectral  density  approximately  flat  over  the  matched 


filter  bandwidth  (l/T  Hz)  at  the  value 


S  (y)  =  f-  +  S  (0) 
nr.  2  yy 

N  , 

=  [  l+^s-l 


where  Syy^  is  the  cornnon  P°wer  spectral  density  of  the  I/Q  components  of 
the  shot  noise  and  we  have  made  use  of  the  fact 

2 

Syy<0>  *  S - If  <V2)  '  <6l> 

It  follows  that  the  error  probability  in  this  environment  is  simply  the 
corresponding  result  for  the  background  AWGN  alone  shifted  to  the  right  by 

d2  =  10  log10  1*  gjl  dB  .  (62) 

U  s  -I 

According  to  Table  2,  for  the  case  treated  here  BT  -1  so  the  condition  BT  >1 

s  s 

is  not  quite  satisfied.  Nevertheless,  we  see  that  for  large  ATg,  or  equiv¬ 
alently  large  A/a,  the  error  probability  does  indeed  converge  to  the  approx¬ 
imate  asymptotic  performance. 

Finally,  observe  that  simulation  results  have  also  been  included  in 
Fig.  2l*  indicating  close  correspondence  with  the  computed  results.  The 
simulation  results  were  again  obtained  on  the  ICS.  Because  of  this  close 
correspondence  between  simulated  and  computed  results  and  the  general 
intractability  of  analytical  evaluation  in  more  general  cases, we  will  make 
exclusive  use  of  simulation  results  in  what  follows. 

In  Fig.  25  we  illustrate  the  effect  of  y2T  on  error  probability  per¬ 
formance  with  BPSK  modulation  and  typical  parameter  choices  corresponding 


to  the  APD  results  in  Fig.  5-  Here  it  is  clear  that  even  a  small  amount  of 


impulse  noise  can  have  a  disastrous  effect  on  the  error  probability  perform¬ 
ance.  The  corresponding  effect  of  the  parameter  aT  on  P  with  other 

s  e 

parameters  held  fixed  is  illustrated  in  Fig.  26.  For  small  aTg  ,  or  equivalently 
large  1/a,  the  pronounced  broad  tails  are  absent  and  Pg  does  decrease  rapidly 
with  increasing  as  the  total  noise  environment  is  then  approximately  Gauss¬ 

ian.  The  results  here  correspond  to  APD's  illustrated  previously  in  Fig.  7. 

In  Fig.'s  27  and  28,  we  illustrate  the  effect  on  P  of  a  and  V,  for  the 

e  d 

power-Rayleigh  and  lognormal  distributions,  respectively.  For  the  parameter 

choices  considered,  BTs>l  while  X/a>l  so  the  performance  is  expected  to  be  given 

by  our  approximate  result  valid  under  the  assumption  the  total  noise  environment 

is  Gaussian.  As  can  be  seen  this  is  true  only  for  relatively  large  a  or  small 

V^.  In  Fig.  29  we  illustrate  that  if  a  and  are  chosen,  as  in  Fig.  11  to 

provide  a  close  match  in  APD,  then  the  resulting  error  probability  performance 

is  likewise  quite  close.  This  is  further  illustrated  in  Fig.  30  where,  for  a 

unipolar  pulse,  varying  the  characteristic  pulse  shape  has  little  effect  on  P  , 

e 

as  observed  previously  for  APD's  in  Fig.  12.  For  a  bipolar  pulse,  on  the  other 

hand,  P  improves  with  increasing  toT  since  then  the  matched  filter  is  able  to 
6  S 

average  the  effects  of  the  impulse  noise  component.  Similarly,  in  Fig.  31  we 
demonstrate  that  varying  v,  and  hence  the  pulse  interarrival  distribution,  has 
little  effect  on  Pg  consistent  with  APD  behavior  illustrated  in  Fig.  13. 

Finally,  in  Fig.  32  we  illustrate  some  typical  results  employing  other 
coherent  modulation  strategies.  As  expected,  the  QPSK  results  are  nearly 
identical  to  those  for  BPSK  with  only  minor  differences  from  the  behavior  of  MSK. 

8.  NoncoheAtnt  ModuZaCion:  We  will  be  specifically  concerned  with  the  error 
probability  performance  of  noncoherent  BFSK  and  DPSK.  By  exploiting  well- 
known  analogies  [l8]  between  these  two  modulation  strategies  it  is  possible 
to  develop  a  unified  approach.  Consider  first  the  case  of  BFSK  where 


h-27 


individual  bit  decisions  are  generally  made  on  the  complex  matched  filter  outputs 


ri,k 


,(i+3s)T 


i-HjT 


r (t)u.*(t-i?  )dt;  i=C,l,2,.. 

*  5  i 

*  v  >-*•  > 


(63) 


vhere  {^(t),  k=0,l  are  appropriately  defined  complex  vaveforms  associated 
vith  each  of  the  binary  symbols  to  be  transmitted.  Specifically, 

\(t)  =  uc(t)+jxkug(t)  ;  k=0,l  (6U ) 

vith  x=(2k-l),  k=0,l  and  u  (t),  u  (t)  the  appropriate  baseband  I/Q  vaveforms 

A  C  S 

from  Table  1. 

Final  bit  decisions  for  BFSK  are  then  based  upon  a  threshold  test  on  the 
statistics 

zi  =  |fi,J2’  l*i,0l2  ;  is0‘1 .  (65) 

such  that  the  declared  binary  sequence  (5L)  is 


=  sgn  zi  .  (66) 

After  appropriate  normalization,  the  statistics  given  by  (63)  evaluated  dur¬ 
ing  any  one  signaling  interval  are  of  the  generic  form 


and 


eJ6+V*o  * 

eJ  6+y1+v1  » 


(67a) 


(6Tb) 


for  k=0,l,  respectively.  Here,  x=±l  represents  the  transmitted  bit  vhile  6 
is  an  unknovn  phase  offset  assumed  uniformly  distributed  over  [ — tt , t ] .  The 
quantities  k=0,l,  are  mutually  independent  zero-mean  complex  Gaussian 
variates  vhose  real  and  imaginary  components  are  each  of  unit  variance. 
Finally,  y^,  k=0,l  represent  contributions  due  to  the  shot  noise  component 
defined  according  to  y,  =y.  -Jy,  _  vith 


(68a) 


yk,c  =  Be 


/K0Ts  •  yizKU--s/2)dt\ 


N(Tg) 


-  I  z!k’c)s(tj  ;  >-=0,1 


while 


while 


and  finally 


while 


T 

yk,s  ~  Im  J KqT  y(t)u.£(»,-Tg/2)dT;  ■ 

•V  s  o  < 

(68b) 

N{Ts) 

=  I  z!k,s)g(t.)  ;  k=0,l 

i=l  1  1 

>/g^(t)+g^(t)  with 

fT 

6c(t)  =  T~  \(T-t)uc(T-Ts/2)di  , 

S  0 

(69a) 

Ts 

gs(t)  "  T~  [  h0(T-t)u  (x-Te/2)dT, 

5  0  “ 

(69b) 

z(°,c)=  2(Tg/M0)%Ricos(6i+ifA-*i)  , 

(70a) 

z|°’s)=  2(Ts/N0j’5Risin(ei+«A-^i)  , 

(70b) 

Z,(l,c)=  2(T  /Nn)Vcos(e ,+xA+d.  )  , 

1  SUl  i  i 

(70c) 

z(!,s)=  2(Ts/K0)5sR.sin(ei+-A+di)  , 

(70d) 

with  =  tan1gs(ti)/gc(ti). 

We  now  show  the  relationship  of  noncoherent  D?SK  to  BFSK  following  the 
developaent  in  [l8].  Specifically,  for  D?SK  individual  bit  decisions  are 
Bade  on  the  basis  of  the  single  sequence  of  complex  matched  filter  outputs 


(i+*s)T. 


?(t)u  (t-iT  )dt  ;  i=0,l,2, . . .  » 

c  s 


(i-%)T 


where  u  (t)=u_(t)  is  the  corresponding  "baseband  waveform  from  Table  1.  Final 
c  0 

decisions  are  then  made  on  the  statistics 

z.  =  Re{f.f*  ,}  ;  i=0,l,2, . . .  (72) 

i  i  i-1 

with  the  declared  sequence  computed  according  to  (66).  As  shown  in  [l8] , 

this  last  statistic  can  be  expressed  in  the  form 


which  is  functionally  identical  to  ( 6U )  for  BFSK  where  now 


Wi 


•  r 1 

*  ri,o 


f  -f 
i  ri-l 


The  sequences  {? '  .  }  ,  k-0,1  can  then  be  computed  according  to 
1 

r(i+*s)T 

k  =  B  f(t)u£(t-iT.)dt  ; 

'  U-3/2)t  °  k=0tl> 


where 


u  (t)+u  (t+T  ) 

.  / .  n  A  c  c  s 

“i(t)  2 

.  u  (t)-u  (t+T  ) 

„.{+\  k  _S - S - ■  _ 

U0u;  2 


(76a) 


(76b) 


Normalized  versions  of  the  decision  statistics  given  by  (75)  computed  dur¬ 
ing  any  one  signaling  interval,  corresponding  to  those  given  by  (67)  for 
BFSK,  are  then* 

t  We  assume  that  the  preceding  baud  symbol  is  known  so  that  the  possibility 
of  paired  errors  is  neglected  in  this  formulation. 
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‘i  ’  (^TT  '3*?i  *  ci 


(77a) 


(77b) 


Here,  the  quantities  w^,  k=0,l  are  Gaussian  noise  components  with  identical 
interpretation  as  in  (67)  and  the  sequel.  Likewise,  the  components  y  ,  k=0,l 

A 


are  due  to  the  shot  noise  where  now  y.’  =y,  -Jy,  with 

k  k,c  k,s 


where  now 


while 


R(2T  ) 

yk,c  =  .£  S  zi,cgk(ti)  ;  kss0>1 


R(2T  ) 
r  s 


,1  (-  )  £  _i_ 

ilUj  2T 

e 


h_(T-t)dx 


s£(t)  *  j  hQ(T-t)dT-  h0(T-t)dx|  . 


(76a) 


(78b) 


(79a) 


(79b) 


The  corresponding  random  amplitudes  in  this  case  are 


Z’  =  2(T  /K.)Vcos(9.+i;A)  , 

i  ,c  s  o  l  i 


Z’  e  =  2(2T  /K.rP.sin(6 ,+-A)  . 

l  ,s  s  0  i  i 


(60a) 


(80b) 


A  few  comments  are  in  order  concerning  the  form  of  the  normalized  de¬ 
cision  statistics  in  (67)  and  (77)  for  noncoherent  BFSK  and  DPSK,  respect¬ 
ively.  In  the  absence  of  the  shot  noise  component,  it  is  clear  that  DPSK 
offers  a  3dB  advantage  over  BFSK  which  is  a  well-known  result  for  the  AWGN 
channel.  In  the  presence  of  shot  noise,  however,  the  performance  can  be 


decidely  different  due  to  the  distinct  difference  in  the  corresponding  shot 


noise  contributions  y^  end  y^.  Furthermore ,  although  the  shot  noise  com- 
ponenns  in  the  two  natched  filter  outputs  are  uncorrelated  in  the  sense 
that  yj}  =  E{ygy^*}=  0,  it  can  not  be  inferred  that  they  are  therefore 
statistically  independent.  This  severely  complicates  the  analysis  of  error 
probability. 

Based  upon  the  preceding,  a  unified  approach  to  the  evaluation  of  bit 
error  probability  can  be  formulated.  Specifically,  assume  that  a  "0"  has 
been  transmitted  (x=-l),  then  an  error  is  made  with  BFSK  if  d^= |§q | < |d^ |=d^ 
so  that 


Pe  =  FrU0<V 


Similar  expressions  hold  for  DPSK  with  d^=|d£|  k*0,l  replacing  the  corres¬ 
ponding  unprimed  quantities.  In  what  follows  we  consider  BFSK  exclusively 
and  merely  state  the  corresponding  results  for  DPSK. 

Clearly,  the  error  probability  can  be  determined  as 


CO  ,0O  .00  .00 


nr{Vailyo,cr0,s-lrl,c'yl,s)f(y0,c-:),0,s'5rl,c-3rl>S) 


—  CC_  C0_  00_C0 


4y0,cay0,.4yl,c'iyl,s  •  (82> 

where  Pr{d  <d- |yn  ,yn  ,y,  ,y  }  is  the  conditional  error  probability  given 

yO,c’yO,s’yl,c’yl,s  *  or  equivalently  ?Q  and  *r  and  f(y0>c.y0.s’yl,c>yl,s) 
is  the  corresponding  Joint  p.d.f.  This  expression  then  provides  a  parallel 
to  (52)  for  coherent  modulation.  Indeed,  following  the  development  in  [l8], 
the  conditional  error  probability  is  easily  shown  to  be  given  by 

Pria0<d1|yo,c.yo,s-yi1c'yi,s,=  |  •  (83) 


where 


Q(a,b)=[  xexpj-  — *X— lljax)dx 


(8M 


is  Mar cue’s  Q- function  l19j  and  ~ci’)  is  the  modified  Bessel  function  of 
the  first  kind  of  order  zero.  The  quantities  a  and  b  in  (83)  are  defined 


according  to 


a  =  |(y?_c-y2jS)  , 


b  *  n(yo,c*-^5  cosS)2+(y0,s t/5 sin6)2} 


for  BFSK  vhile 


a  «  |(yi^c+yi%)  .  (86a 

and  _  _ 

b  =  i{(yo,c+Ji^  cosS)2+(y6,s+i^ sin6>2}  *  <86t 

for  DPSKT.  As  in  the  evaluation  of  P  through  (52)  for  coherent  modulation. 

e 

use  of  (82)  for  noncoherent  modulation  is  somewhat  impractical  due  to  the 
general  inability  of  obtaining  concise  expressions  for  the  joint  p.d.f. 

f<y0,c*3r01»*3rl,c»3rl,s)* 

In  Appendix  D  ve  show  that  the  error  probability  can  alternatively  be 


written  in  terms  of  ch.f.'s  according  to 


P  =i  +  i- 
e  2  2  t 


''dp  ,di  (-v,v]x=-l) 


dv  , 


where  ^(vQ,v^ |x=-l)  is  the  conditional  joint  ch.f.  of  the  decision  vari¬ 
ables  dQ  and  d^  given  x=-l  was  transmitted.  This  result  is  then  analogous 
to  the  development  in  Appendix  C  leading  to  (5*0  for  coherent  modulation. 
Unfortunately,  for  the  situation  considered  here  the  evaluation  of  the  joint 


c,y6,s,yi,c,y: 


ch.f.  ^(v^.y^),  although  possible  in  some  special  cases,  becomes  un¬ 
wieldy  in  general"*".  As  a  result  we  resort  exclusively  xo  simulation  results 
in  vhat  follows. 

In  Fig.  33  we  illustrate  the  performance  of  BFSK  for  the  case  of  Poisson 

arrivals  and  power-Rayleigh  distributed  pulse  amplitudes  with  a=1.0.  The 

characteristic  pulse  shape  is  h.(t)=e  u  ,(t)  with  aT  =1.0  and  y2T  =20d3. 

u  -l  s  s 

Observe  the  convergence  to  the  equivalent  all-Gaussian  performance  with 

increasing  XTg.  Similar  results  are  illustrated  in  Fig.  3l»  for  DPSK.  Note  the 

3dB  performance  advantage  of  DPSK  vis-a-vis  BFSK  for  large  XT  when  the  total 

s 

noise  environment  is  close  to  Gaussian.  For  smaller  XT^,  or  more  impulsive 
noise  situations,  this  performance  advantage  can  be  considerably  reduced. 

In  Fig.'s  35  and  36  we  illustrate  the  performance  of  both  BFSK  and  DPSK  as 
a  function  of  Y2Tg  for  selected  parameter  choices.  Finally,  in  Fig.'s  37 
and  38  we  illustrate  the  error  probability  behavior  as  a  function  of  aTg. 

As  in  previous  work,  little  sensitivity  was  observed  for  either  BFSK  or  DPSK 
to  characteristic  pulse  shape,  interarrival  statistics,  or  pulse  amplitude 
distribution  provided  the  /PD's  were  matched  as  in  Fig.  11. 

V.  Nonlinear  Receiver  Performance: 

The  preceding  section  has  demonstrated  the  often  catastrophic  effect  of 

impulse  noise  on  linear  matched  filter  reception.  Clearly,  some  form  of 

nonlinear  front-end  processing  of  the  limiting  or  saturating  variety  should 

provide  some  protection  against  the  large  pulse  amplitudes  -which,  although 

occurring  relatively  infrequently,  contain  a  large  portion  of  the  overall 

signal  energy.  This  point  has  been  argued  previously  [20]-[25],  although 

to  date,  little  progress  has  been  made  ir.  developing  a  general  approach. 

t  IJcte  that  part  of  the  difficulty  is  due  to  the  fact  that  d^d^  are  not 
necessarily  statistically  independent. 
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The  alternative  is  to  propose  a  class  of  nonlinear  characteristics  and  ex¬ 
plore  the  resulting  performance  as  a  function  of  various  parameter  choices 
defining  these  nonlinearities.  Unfortunately,  even  here  a  comprehensive 
treatment  has  not  yet  been  provided. 

In  what  follows  we  assume  nonlinear  predetection  processing  preceding 
a  conventional  matched  filter  data  demodulator  as  illustrated  in  Fig.  39. 
Specifically,  we  assume  a  configuration  consisting  of  a  zero-memory  nonlinear 
(ZMNL)  characteristic  sandwiched  between  two  narrowband  filters  possessing 
equivalent  rectangular  bandwidths  and  Bg  respectively.  Actually,  provided 
BIT  ^1  the  second  filter  has  little  effect  on  performance  followed  as  it  is 
by  the  narrowband  matched  filter.  Thus,  performance  will  be  determined  chiefly 

by  the  first  filter  preceding  the  ZMNL  characteristic.  The  quantity  B’T  then 

1  s 

plays  the  same  role  as  in  the  fictitious  measurement  filter  used  in  describ¬ 
ing  the  APD's  and  ALCR's. 

The  class  of  ZMNL  characteristics  to  be  considered  can  be  implemented  as 
illustrated  in  Fig.  I4O  where  g(  • )  is  in  general  some  linear  function  of  the 
signal  envelope  R(t)=|?(t)|.  For  example,  with  s(R)=l/R,  the  result  can  easily 
be  seen  to  be  a  narrowband  hard-limiter  with  zero  phase  distortion.  The  chief 
motivation  for  restriction  to  nonlinear  processing  structures  of  this  form 
lies  in  some  related  work  [26]  where  it  is  shown  that,  under  the  assumption 
of  independent  noise  samples  and  weak  or  threshold  signals,  the  optimum  ZMNL 
characteristic  is  of  the  form"*" 

<WE)  *  '  W®1  •  <86> 

where  f(R)=f  (R)/2nR  with  f  (•)  the  envelope  n.d.f.  of  the  filtered  imnulse 
a  a 

t  In  [26]  the  characteristic  defined  by  (87)  is  referred  to  as  "locally  opti¬ 
mum",  and  hence  the  subscript  on  g„Q(R). 
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when  the  pulse  amplitude  distribution  is  pover-Rayleigh  distributed  vith  ct= 

1.0  and  hQ(t)=e  u  1(t).  The  joint  ch.f.  is  then  given  by  (Uo).  In  Fig. 

kl  we  illustrate  the  behavior  of  a^g^R)  versus  the  envelope  normalized 

to  its  rms  level  for  selected  values  of  y2T  and  vith  B'T  =l6.  Here  the 

s  Is 

quantity  a2,  is  the  common  variance  of  the  I/Q  components  of  the  filtered 

noise  process  fi'(t).  As  can  be  seen  the  effect  of  increasing  y2Tg  is  to 

severely  limit  and  hence  de-emphasize  the  large  noise  peaks.  In  Fig.  U2  we 

demonstrate  the  behavior  for  various  values  of  vith  y2Ts=2CdB.  Again 

for  fixed  y2T  increasing  B'T  ,  assuming  B'>3,  allows  additional  Gaussian 

noise  so  that  the  ZMNL  characteristics  are  merely  shifted  to  the  right  in 

accordance  with  the  parameter  t.  defined  in  (1*3)  vith  B,’T  renlacing  B'T  . 

l  is*  s 

Finally,  in  Fig.'s  1*3  and  we  illustrate  the  effect  of  aTs  and  XTg,  re¬ 
spectively.  Note,  in  particular,  that  for  X/a>>!  the  locally  optimum  ZMNL 


+  Actually  in  Fig.'s  Ll-Ul*  we  plot  the  cuantity  o2,g^0(R)  normalized  to 
its  maximum  value  at  R=0,  versus  ‘  ” 
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approaches  the  constant  value  1.0  which  is  optimum  for  Gaussian  noise. 

For  more  severe  noise  environments  (i.e.,  smaller  a)  the  optimum  ZMNL  char¬ 
acteristics  exhibit  more  pronounced  limiting  behavior  and  less  saturations 
at  large  envelope  values. 

While  these  results  are  useful  in  understanding  the  behavior  of  the 
locally  optimum  ZMNL  characteristic,  it  remains  to  approximate  this  behavior 
by  a  practical  class  of  ZMNL  characteristics  which  do  not  require  a  priori 
knowledge  of  noise  statistics  since  very  rarely  would  they  be  known  in  a 
realistic  data  communications  application.  Consequently,  we  restrict  atten¬ 
tion  to  the  ZMNL  characteristics  listed  in  Table  3  which  more  or  less  approx¬ 
imate  the  behavior  of  the  locally  optimum  characteristic.  ; 

! 

In  Fig.  1*5  we  illustrate  typical  simulated  behavior  using  coherent  BPSK 

modulation  and  selected  characteristics  from  Table  3.  The  pulse  arrivals  are 

Poisson  and  y2T  =20dB  with  B'T  =16.  Other  pertinent  parameters  are  illustra- 
S  X  s 

I 

ted  in  Fig.  1+5.  The  predetection  filter  in  all  cases  possesses  a  second-order  , 

Butterworth  characteristic  with  prescribed  equivalent  rectangular  bandwidth. 

Also,  the  appropriate  clip  levels  corresponding  to  ZMNL  characteristics  in 
Table  3  were  arbitrarily  adjusted  to  OdB  above  the  received  signal  level. 

While  the  choice  of  clip  level  is  obviously  an  important  parameter,  we  have 
not  attempted  to  optimize  this  quantity.  The  justification  for  the  choice 
made  here  is  that  it  clearly  eliminates  most  of  the  large  amplitude  peaks 
while  preserving  the  signal  structure.  In  all  subsequent  work  the  clip  levels 
have  both  held  fixed  at  this  value. 

As  evidenced  in  Fig.  1*5,  the  ZMNL  predetection  processing  can  be  quite 
effective  in  combatting  the  effects  of  impulse  noise.  For  example,  the  alge¬ 
braic  nonlinearity  with  n=U  provides  an  improvement  over  the  linear  receiver  of 
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roughly  9dB  at  Pg=5xl0  .  The  improvement  using  the  other  nonlinear  receiv¬ 
er  structures  is  only  marginal,  although  this  can  be  improved  considerably  in 
more  intensive  noise  environments  and/or  through  choice  of  predetection  band¬ 
width.  This  is  illustrated  in  Fig.  46  where  the  error  probability  for  a 

hard-limiting  receiver  is  shown  for  selected  choices  of  B'T  in  a  noise 

X  s 

environment  characterized  by  aT  =20.0  and  y2T  =40dB  representing  severe 

s  s 

impulse  noise.  Small  values  of  B'T  not  only  result  in  intersymbol  inter- 

X  s 

ference  (ISI)  effects  which  degrades  performance,  but  also  disperses  or 
lengthens  the  characteristic  pulse  shape  making  the  hard-limiter  less  effec¬ 
tive  in  suppressing  the  large  noise  peaks.  For  example,  even  for  B’T  =16 
the  impulse  response  of  the  filter  is  approximately  3  times  the  character¬ 
istic  pulse  shape  for  aTs=20.  The  characteristic  noise  pulse  at  the  output 
of  the  filter  is  then  lengthened  by  roughly  a  factor  of  three.  The  pulse 
duration  is  then  crudely  approximated  by  an  "effective"  a'Ts=20/3=7.  In 
Fig.  47  we  illustrate*  Pe  behavior  for  a  hard-limiting  receiver  as  a  function 

of  aT  from  which  it  is  clear  that  performance  degrades  rapidly  for  decreas- 
s 

ing  values  of  aT^,  and  hence  less  narrowly  confined  characteristic  pulse 

shape.  The  conclusion  is  that  the  predetection  bandwidth,  at  least**  for 

aT  >>1,  should  be  made  as  large  as  possible.  For  smaller  aT  ,  the  predetec- 

tion  bandwidth  requirements  are  much  less  severe.  This  is  illustrated  to  an 

extent  in  Fig.  48  where  we  illustrate  results  for  a  hard-limiting  receiver 

both  without  a  filter  and  with  a  predetection  bandwidth  B|Tg=8.  For  the 

smaller  aT  values  there  is  little  difference  between  the  two  results  as  was 
s 

to  be  expected. 

t  Corresponding  results  for  a  linear  receiver  are  illustrated  in  Fig.  26. 
ft  And,  of  course,  the  simultaneous  constraint  A/a<<l. 


As  an  illustration  of  the  performance  gains  to  be  realized  by  nonlinear 
processing  and  appropriate  predetection  filtering,  the  corresponding  results 
in  Fig.'s  26  and  U7  for  aTg=20  indicate  a  substantial  improvement  of  almost 

_3 

20dB  at  P  =10  .  In  Fig.  1*9  we  illustrate  the  P  behavior  of  a  hard-limit- 

e  e 

ing  receiver  as  a  function  of  Y2Tg  with  a  fixed,  although  suboptimum,  prede¬ 
tection  bandwidth  of  B|T  =16.  Wider  predetection  bandwidths  should  improve 
performance  in  light  of  previous  comments,  but  this  is  the  widest  bandwidth 
that  can  be  conveniently  simulated  in  the  ICS.  Nevertheless,  the  results  are- 
useful  in  demonstrating  the  dependence  upon  Y2Tg.  Similar  results  for  a  re¬ 
ceiver  employing  a  U'th  order  algebraic  nonlinearity  are  illustrated  in  Fig. 50 
VI.  Summary  and  Conclusions: 

We  have  provided  a  detailed  description  of  a  useful  and  conveniently 
parameterized  model  for  impulsive  noise.  Some  of  its  low-order  statistical 
properties  have  been  investigated  both  analytically  and  through  simulation 
Similarly,  we  have  described  the  error  probability  performance  of  a  variety 
of  linear  and  nonlinear  receiver  structures  in  noise  environments  as  describ¬ 
ed  by  this  model.  The  rationale  underlying  the  use  of  nonlinear  predetect 
ion  processing  was  described  and  the  performance  advantages  of  such  process¬ 
ing  illustrated. 

While  these  results  should  prove  useful  in  the  design  and  development 
of  digital  communication  systems  operating  in  impulsive  noise,  much  more  work 
remains.  Specifically,  it  is  believed  that  analytical  methods  although  ted¬ 
ious,  can  be  carried  much  further  than  described  here.  This  is  particularly 
true  for  linear  receiver  performance  although  somewhat  questionable  for  non¬ 
linear  processing  structures.  At  any  rate,  more  comprehensive  results,  either 
analytical  or  through  simulation,  are  required  to  obtain  a  complete  picture 


of  nonlinear  receiver  performance  in  impulse  noise  as  described  here. 


Evaluation  of  Characteristic  Function 
of  Shot  Noise  Component  for  Poisson  Arrivals 

We  are  interested  in  the  evaluation  of  the  joint  ch.f.  of  I/Q  compon¬ 


ents  of  the  shot  noise  component  given  by 

K  v  (vn»vo)  =  E{exp[j(v  y  (t)+v  y  (t)]} 
yc,yg  !  2  1  c  2  2 


(A-l) 


« v  t ; 

=  E{exp[j/2  P_h0(t-ti){v1cos(6i+'rr/l4)+v2sin(ei+'iT/li)}]}j 
where  we  have  made  use  of  (23)  and  the  sequel.  This  last  expression  can  then 


be  written  as 


where 


v  (vi»v2)  =  E<exp[ l  z  h  (t-t  )]}  , 
Jc,Js  i=l 


z±  =  Ricos(ei+7r/U-(+) ,  i=l ,2 , .  . .  , 


(A-2) 


(A-3) 


with  4>  =  tan1  v2/v  .  But  the  right-hand  side  of  (A-2)  is  merely  the  ch.f. 

of  a  baseband  filtered  shot  noise  component  with  random  amplitude  S2£z^  and 

impulse  response  hQ(t-ti).  While  this  is  difficult  to  evaluate  in  general, 

the  result  is  well  known  in  the  case  where  {N(t),  t>0}  is  Poisson.  In 

particular,  in  this  case  we  have  (cf.  [6]) 

.00 

K  v  (vi’vc>)  =  ex?{x  U  (t^Sh  (x))-l]dT}  ,  (A-M 

1v  0 

where  ij;  (v)=E{eJ  z)  is  the  common  ch.f.  of  the  i.i.d.  sequence  defined 
z 

according  to  (A-3).  Specifically,  we  have 

00 

*z(^Bh0(T))  =  |  [ij  p^8K,o(T)",s<6+’/“'-*,ae]fr(ii)di1 

0  -n 

=  f  J_( t/ISRh-l t ) )f  (R)dP  , 


(A-5) 


vhere  ve  make  use  of  the  definition  of  the  Bessel  function  of  the  firs' 


kind  of  order  zero  defined  according  to 


V*>  *  2? 


eJ«o.(8-c)a6 


— 7T 


(A-6) 


It  follows  then  that 


y  ^vi*V2^=exp^X[  JQ(  »^6HIi0(t  )  }-l]dT>  , 


(A-7 ) 


c  s 


vhere  E  {•}  indicates  expectation  with  respect  to  the  envelope  distribution 
r 

according  to  the  last  term  on  the  right-hand  side  of  ( A—  5 ) • 
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APPENDIX  B 


Alternative  Error  Probability  Expression  for  Coherent  Modulation 

The  error  probability  for  linear  matched  filter  reception  employing 


coherent  modulation  can  be  expressed  in  the  form 


Pg  =  pQPr{d>_o|x=-l}+p1Pr{d<o|x=l}  , 


(B-l) 


where  d  is  the  decision  variable  given  by  (i*8)  and  Pq>P^  are  the  prior 
probabilities  of  x=-l  and  x=l,  respectively.  Under  equiprobable  signaling 
we  clearly  have 


P  =Pr{d>o|x=-l} 

"  r° 

=1-  f  (C|x=-l)d5  . 


(B-2) 


In  terms  of  ch.f.'s  it  is  easily  shown  that 


fd( £ |x=-l)d£=  u_1(-?)fd(5 |x=-l)d£ 


Si  +  L[ 

2  2  it 


“  P(v|x=-l) 


(B-3) 


where  <k(v|x=-1)  is  the  conditional  ch.f.  of  d  given  x=-l.  Recall  also 
a 


4/d(v|x=-l)=^y(v)exp' -j v-  -  v  •  , 


(B-U) 


where  ^  (v)  is  the  ch.f.  of  the  shot  component.  Thus, 


t„(v) 


p  .ill  Ip  vL-%  dv 

e  2  2n  j  jv  Nq 


1  if",  sin^VW0V  -v/2dv 
=  2  +  7  Vv)  - V - 6 


(B-5) 


which  is  readily  evaluated  by  quadrature  given  the  ch.f.  <l>  (v)  of  the  shot 


noise  component. 
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APPENDIX  C 


Evaluation  of  Characteristic  Function  of  Shot  Noise 
Component  at  Output  of  Linear  Matched  Filter  in  Special  Case 


We  evaluate  the  ch.f.  i p  (v),  given  by  (55),  in  the  special  case  of 

y 

coherent  BPSK  modulation  vhen  the  input  pulse  amplitude  distribution  is 
pover-Rayleigh  vith  characteristic  exponent  a=1.0,  and  the  filter  generating 
the  shot  noise  is  described  by  hQ(t)=e  u  ^(t).  In  this  case  ve  find 
from  (51)  that  for  t<Tg 

g(t)  =  S  e**a(T-t ,  (C-l) 

S  max(0,t) 


g(t)  =  ]aTs 


|-ll-e"a(Ts_t)];  0<t<T 


^[l-e-aTsl  j  t<0  , 

s 


(C-2) 


vhereupon  substitution  into  (56)  yields  ij;  ( • ) .  At  this  point  it  is  conven¬ 
ient  to  express  i}^(v)  in  the  form 

♦  (v)  =  v"(v)/(v)  ( C— 3 ) 

<J  v 


where 


(c-M 


has  an  interpretation  as  the  ch.f.  associated  vith  contributions  due  to  past 

impulses,  and  ^ 

'J'y(v)  =  expjx  [^z,^vg( t  )  )-l]dT^-  ,  (C-5) 

0 

which  is  the  ch.f.  associated  with  impulses  occurring  during  the  present  sig¬ 
naling  interval.  We  will  be  particularly  interested  in  evaluation  of  the 
integrals  appearing  in  these  two  expressions. 


4-43 


APPENDIX  I 


Alternative  Error  Probability  Expression 
for  Noncoherent  Modulation 


The  error  probability  for  matched  filter  reception  employing  noncoherent 
modulation  and  equiprobable  signaling  can  be  expressed  in  the  generic  form 

Pe  =  PrVdlix=-13 

.  fd0,di{CO,5llX="i)dCOdCl 


0  0 
r“  r 


u-l^l“^03fd0,d1^C’^l  -l3d^0d^l  ’  (B-l) 


_  CO  .CO 


vhere  dp,  dj  are  decision  variables  appropriate  for  either  BFSK  or  DPSK.  Now 
from  Parseval’s  theorem  this  last  expression  leads  to 


P  =  yz 
e  Ijtt' 


U(v0»vi)%,d1(v0’vilx=-1)dv0dvi  •  (D“2) 


_oo  _00 


vhere  iK  j(v.,v.  |x=-l)  is  the  joint  ch.f.  of  the  decision  variables  d-.cL 

ap ,  aj  0  l  0  1 

given  that  x=-l ,  and 


U(v0’V  = 


u.1(Cl-C0)e‘,i(5oWl)d?1d50  , 


(D-3) 


is  the  Fourier  transform  of  u  ^(Ci-Co)'  Clearly , under  a  change  of  variable 

U(VV  =  {  u.1(T)e”JVlTdT}e"'!( V0+V1)C0  d^0 

05  00 

=  [it6(v1)  +  j^-][2tt5(v0+v1)]  (D-U ) 

2tt6(v  +v  ) 

=  2h26(v0)6(v1)  +  - — -  ♦ 

where  we  have  made  use  of  obvious  transform  pairs.  Substitution  into  (D-2) 
yields 


p  =  —  +  — 

e  2  2n 


.  {-v,v  x=-l) 

-  dv  . 

Jv 


(D-5) 
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Comparison  of  Computed  and  Simulated  APD  Results  for 
Special  Case  of  Pover-Rayleigh  Pulse  Amplitude  Distri¬ 
bution  with  Characteristic  Exponent  a  =1.0. 
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Figure  5 

Typical  APD  Behavior  as  a  Function  of  y2T  for  Selected  Parameter  Choices 
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Figure  9 

Typical  APD  Behavior  as  a  Function  of  Characteristic 
Exponent  a  for  Selected  Parameter  Choices 
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Figure  10 

Typical  APD  Behavior  as  a  Function  of 
for  Selected  Parameter  Choices. 
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Figure  11 

Comparison  of  APD  Behavior  for  Pover-Rayleigh 
and  Lognormal  Distributed  Pulse  Amplitudes 
under  Selected  Parameter  Choices. 
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Typical  APD  Behavior  as  a  Function  of  Pulse  Shape  h^t) 
for  Selected  Parameter  Choices. 
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Figure  13 

Typical  APD  Behavior  as  a  Function  of  v 
for  Selected  Parameter  Choices. 
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Figure  19 

Typical  ALCR  Behavior  as  a  Function',  f  Charact¬ 
eristic  Exponent  a  for  Selected  Parameter  Choices 
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Typical  ALCR  Behavior  as  a  Function  of  for 
Selected  Parameter  Choices. 
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Figure  21 

Comparison  of  ALCR  Behavior  for  Power- 
Rayleigh  and  Lognormal  Distributed  Pulse 
Amplitudes  under  Selected  Parameter  Choices 
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Figure  22 

Typical  ALCR  Behavior  as  a  Function  of  Pulse 
Shape  hg(t)  for  Selected  Parameter  Choices. 
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Figure  26 

Typical  P  Behavior  for  Linear  Matched  Filter 
Reception6as  a  Function  of  aT  for  Selected 
Parameter  Choices:  Coherent  B?SK  Modulation. 
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Figure  28 

Typical  P  Behavior  for  Linear  Matched  Filter  Recep¬ 
tion  as  arunction  of  V  for  Selected  Parameter  Choices; 
Coherent  BPSK  Modulation. 
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Figure  29 

Comparison  of  P  Behavior  for  Linear  Matched  Filter 
Reception  with  ?ower-Rayleigh  and  Lognormal  Distri¬ 
buted  Pulse  Amplitudes  under  Selected  Parameter 
Choices;  Coherent  BPSK  Modulation. 
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Typical  P  Behavior  for  Linear  Matched  Filter 
Reception  as  a  Function  of  Pulse  Shape  h-.(t) 
for  Selected  Parameter  Choices;  Coherent UBPSK 
Modulation. 
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Figure  36 

Typical  P  Behavior  for  Matched  Filter  Re¬ 
ception  as  a  Function  of  y2T  for  Selected 
Parameter  Choices:  Noncoherent  DPSK  Modulation. 
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Typical  P  Behavior  for  Matched  Filter  Recep¬ 
tion  as  a  Function  of  aT  for  selected  Para¬ 
meter  Choices;  Noncoherents  BFSK  Modulation. 
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Typical  P  Behavior  for  Matched  Filter  Recep¬ 
tion  as  aeFunction  of  aT  for  Selected  Para- 
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Figure  1*1 

Computed  Locally  Optimum  ZMNL  Receiver  Charact 
eristic  as  a  Function  of  y2T  for  Selected 
Parameter  Choices. 
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Typical  Pg  Behavior  for  Selected  Linear  and 

Nonlinear  Receiver  Structures ;  Coherent  BPSK 
Modulation;  Clip  Levels  OdB  Above  Signal  Level 
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Typical  P  Behavior  as  a  Function  of  y2T  for 
Selected  Parameter  Choices;  Coherent  BPS& 
Modulation;  Hard-Limiter  Receiver. 
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INCORPORATION  OF  ADAPTIVE  ARRAY  PROCESSING  CAPABILITIES 


INTO  THE  INTERACTIVE  COMMUNICATION  SIMULATOR 


I .  Introduction : 


r  e  analysis  and  design  of  digital  communication  systems  has  historically 
focused  upon  strictly  temporal  processing.  In  an  increasing  number  of  import¬ 
ant  applications,  however,  it  is  clear  that  some  form  of  spatial  processing 
is  desirable.  The  spatial  processing  may  be  used  to  steer  a  beam  in  the 
direction  of  the  desired  transmitter  or  to  place  a  null  in  the  direction  of 
an  interfering  transmitter.  Here  the  interference  cay  be  deliberate  elect¬ 
ronic  countermeasures  (ECM)  or  jamming, or  unintentional  r&dio-frecuency  inter¬ 
ference  (RFI). 

A  number  of  systems  have  been  developed  or  proposed  which  utilize  seme 
form  of  spatial  processing  in  addition  to  the  conventional  oecporal  process¬ 
ing  associated  with  typical  digital  communication  receivers.  However,  there 
has  been  no  systematic  and  comprehensive  study  of  the  subtle  interactions 
between  spatial  and  temporal  processing  structures  in  an  attempt  to  identify 
the  tradeoffs  and  develop  an  optimum  basis  for  rational  design  of  digital 
communication  systems  subject  to  spatially  distributed  interference.  Part 
of  the  difficulty  lies  in  the  intractability  of  a  strictly  analytical  approach. 
Specifically,  although  optimum  spatial/temporal  receiver  szructures  can  be 
determined  under  precise,  and  sometimes  restrictive,  modeling  assumptions, 
the  error  probability  performance  of  these  structures  is  difficult  to  analyze. 
Furthermore,  the  effects  of  deviations  from  modeling  assumptions  is  difficult 
to  establish.  Finally,  the  performance  evaluation  of  suboptimum,  and  presumably 
less  complex  and  more  robust,  processing  structures  is  often  hopelessly  com¬ 
plicated.  As  a  result,  some  form  of  simulation  facility  is  required  as  an 
adjunct  to  ongoing  analytical  investigations. 
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In  the  present  report  ve  provide  an  explicit  description  cf  how 
adaptive  array  spatial  processing  structures  are  to  he  incorporated  into 
the  existing  temporal  processing  capabilities  of  the  HP I  Interactive  Communi¬ 
cations  Simulator  (ICS).  The  ICS  [l]  is  an  extensive  hardware /software  system 
which  has  been  configured  to  allow  realistic  simulation  of  a  variety  of  mili¬ 
tary,  space,  and  commercial  point-to-point  communication  links  with  provision 
for  flexible  future  expansion.  The  important  system  design  considerations 
in  developing  this  simulation  capability  is  discussed  and  the  choice  of 
important  parameters  is  described.  Also  described  is  the  manner  in  which 
this  capability  is  to  be  utilized  in  ongoing  research  cn  digital  communi¬ 
cations  in  spatially  distributed  interference  channels.  It  is  expected  that 
this  additional  spatial  processing  capability  will  significantly  enhance  the 
usefulness  of  the  ICS  as  a  simulation  tool. 

In  Section  II  we  describe  the  channel  signal  and  r.cise  modeling  assump¬ 
tions.  This  is  followed  by  a  description  of  adaptive  array  processing 
structures  and  their  incorporation  into  the  ICS  in  Section  III.  In  Section 
I\  we  discuss  some  future  simulation  studies  to  be  undertaken.  Finally,  in 
Section  V  we  provide  a  summary  and  conclusions. 

II.  Preliminaries : 

In  what  follows  we  will  make  extensive  use  of  complex  narrowband  repre¬ 
sentations  of  all  signal  and  noise  processes.  Specifically,  the  received 
signal  at  the  output  of  the  i'th  element  is  given  by 

r.(t)  =  /2  Re{f.(t)eJ27Tfct}  ;  i=l,2,...,  !I  ,  (l) 

where  f  is  an  assumed  known  carrier  frequency  in  Hz,  and  r.(t)=r  .  (t)-.1r  .(t) 
c  -  1  ci  si 

is  the  corresponding  complex  envelope  expressed  in  terms  of  lowpass  waveforms 
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r  .  (t)  and  r  .  (t)  representing  the  inphase  and  quadrature  '.Z/S)  cormonents 
respectively.  The  complex  envelope  is  furthermore  assumed  of  the  form 

?i(t)=§Oi(t)+Vt)+*i(t)i  (2) 

vhere  sQi(t)  is  the  complex  envelope  of  the  received  signal  component  at 

the  output  of  the  i'th  element,  ?0i(t)  is  the  complex  envelope  of  the  corres¬ 
ponding  interference  component,  and  *^(t)  is  a  complex  zero-mean  additive 

vhite  Gaussian  noise  (AWGN)  component.  Specifically,  v_.  (t)=v  (t)-Jw  .(t) 

i  Ci  si 

vith  vs^ (t )  mutually  independent  WGN  processes  each  possessing  double¬ 

sided  noise  spectral  density  N^/2  vatts/Hz. 

The  signal  component  §oi(t)  is  assumed  of  the  form 


3Qi(t)  *  I  S(T)hi(t,T)dT  ;  i»l,2,...  ,3J, 


where  §(t)  is  the  transmitted  signal  as  it  would  be  received  at  the  output  of 

an  ideal  isotropic  sensor+  located  at  the  origin  of  spatial  coordinates  ana 

excited  only  by  the  incident  signal  field.  The  quantity  b^t,?)  represents 

the  complex  spatial /temporal  impulse  response  of  the  i'th  sensor  where  it  is 

implicitly  assumed  that  the  transmitter/sensor  geometry  is  known  a  priori. 

Similarly,  we  assume  that 

J  rw 


?0i(t)  «  I  j  ?J{t)hiJ(t,x)dT  •,  i-1, 2, (li) 

<5  _CO 

where  y  (t)  represents  the  output  of  an  ideal  isotropic  sensor  to  the  j'th 
interference  source,  j=l,2,...,J  and  represents  the  complex  impulse 

response  of  the  i'th  sensor  to  the  j'th  interference  source.  Again  we  assume 
the  quantities  h^^Ctjx)  ,  i=l,2,...,N;  J=1,2,...,J  can  be  described,  if  only 

in  a  statistical  sense,  given  the  inter ferer /sensor  geometry. 


t  An  ideal  isotropic  sensor  is  defined  as  a  sensor  having  unit  response  to 
signals  incident  from  all  directions  and  all  frequencies. 


While  the  complex  spatial/temporal  impulse  responses  h.  (t,t),  i=l,2, 

^ » J 

•  •  •  »N, ,J=0, 1,2,  —  ,J  may  be  random  as  well  as  time  varying^  we  will  be  spec¬ 
ifically  interested  in  the  case  where  these  quantities  are  deterministic 
time-invariant  quantities  given  the  transmitter/interferer/sensor  geometry. 

That  is,  jij(t,T)=h^j(t-r).  For  example,  in  the  case  of  a  pur  time  delay 
of  seconds  we  have  h^  (t  )=6(t-xi  )e"J2irfcTij .  The  delta  function  accounts 
for  the  group  delay  associated  with  the  complex  envelope  while  the  complex 
exponential  accounts  for  the  corresponding  phase  shift  of  the  carrier. 

It  follows  from  (2)  that 

S0i(t)={sc(t-Ti0)cOs2irfcTi0-Ss(t-Ti0)sin27rfcTi0} 

-J{ss(t-TiO)cos2,rfcTiO+Sc(t-TiO)sin2lTfcTiO}  •  (5) 

If  Bq  represents  the  bandwidth++  of  the  signal  component  s(t)  then  for  the  cases 
of  interest  in  what  follows  it  will  always  be  assumed  that  BQTi0«l,  i=l,2,..., 
N.  It  follows  that  3(t-tiQ):g(t)  so  that 

S0i(t)=:{sc(t)cOs2lTfcTi0'Ss(t)sin2llfcTi0} 

-j{ss(t)cos2nfcTi0+sc(t)sin2TTfcTi0}  ,  (6) 

and  the  terms  within  braces  represent  the  corresponding  I/Q  components,  re¬ 
spectively.  Similarly,  the  interfering  signal  component  in  (U)  can  be  ex¬ 


pressed  as 


?0i(t)  =  J  ■<Wt,COS2’VirysJ(t)Si'*,'fcTi,I> 


-J{ysJ(t)cos2TrfcTiJ+ycJ(t)sin2nfcTiJ}]  ,  (7) 

where  the  terms  in  braces  again  represent  the  corresponding  I/Q  components. 
Thus,  realization  of  the  signal  and  noise  models  merely  require  specification 


+  For  notational  convenience  we  represent  h.(t,i)  as  h.n(t,x)  allowing  concise 
treatment  of  all  complex  impulse  response1 functions. 1 

ft  Actually,  in  what  follows  we  assume  that  B-  is  the  double-sided  equivalent 
rectangular  bandwidth  in  Hz.  ° 


of  the  signal  complex  envelope  §(t),  the  statistics  of  the  interfering 
signals  ^(t),  J=1,2,...,J  and  the  signal/interferer/sensor  geometry  as 
embodied  by  the  delays  ,  i=l,2,...,N,  .3=0,1, 2, . . .  ,J. 

We  assume  that  the  signals  of  interest  are  described  as  in  [l].  Spec¬ 
ifically,  we  restrict  attention  to  digital  signalling  formats  such  that 
§(t)  has  the  representation 

i(t)  j/5  I  tCijilic(t-SkiTj-,)*JC2il-Js(t-ATs-T)]e'>e  •  (6) 

Here,  depending  upon  the  modulation  strategy  employed,  u  (t)  and  u  (t)  are 

c  s 

appropriately  defined  baseband  waveforms  modulating  the  I/Q  rails.  Similar¬ 
ly,  the  quantities  C..  . ,C0  .  depend  upon  the  binary  (i.e.,±l)  data  sequence* 
{x^}  to  be  transmitted  in  a  manner  specific  to  the  modulation  strategy  in 
use.  The  quantity  k  in  (8)  assumes  the  values  0  or  1,  with  the  choice  k=l 
generally  corresponding  to  the  case  of  staggered  overlapping  baseband  wave¬ 
forms  as  in  MSK.  Finally,  the  quantities  t  and  0  in  (8)  are,  in  general,  ran¬ 
dom  timing  epoch  and  phase  offset,  respectively.  An  important  part  of  ongoing 
research  efforts  will  be  the  development  of  appropriate  schemes  for  acquiring 
and  tracking  symbol  synchronization  and  carrier  phase  in  the  presence  of 
spatially  distributed  interference.  A  summary  of  present  modulation  capabil¬ 
ities  in  the  ICS  and  their  representation  in  the  form  (8)  is  provided  in 
Table  1. 

Finally,  the  class  of  interferring  signals  ^(t),  i=l,2,...,J  of  interest 
will  typically  be  burst  or  impulsive  noise  processes  at  specified  geometrical 
locations  relative  to  the  origin  of  spatial  coordinates,  although  we  will  also 
consider  the  case  of  continuous-wave  (CW)  interference.  In  the  former  case 


+  The  sequence  {x^}  actually  represents  the  output  of  the  channel  encoder. 
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we  model  y^lt),  as  e  complex  low-density  shot  noise  process  as 

described  in  detail  in  Part  IV.  Specifically,  the  parameters  describing 
the  burst  or  impulsive  noise  model  are  specified  for  each  of  the  J  interferers. 
An  important  parameter  in  each  case  will  be  the  quantity 

E(|y.(t)|2} 

Y«  -  - S -  ;  J»1,2,...,J  ,  '9) 

J  "o 

or  equivalently  y2T  ,  which  represents  the  power  in  the  j'th  interferer 
j  s 

normalized  to  that  of  the  AWGN  component  when  measured  in  a  double-sided 
noise  bandwidth  equal  to  the  baud  rate  f  =1/T  .  At  any  rate,  the  signal 
power  relative  to  the  power  from  the  j'th  interferer  is  then 


E  /T 
s  s 


E  A 
s'*  u 


J  E( |?j(t) j2}  y2Ts 


j:  r, 

j  •  *  • 


do) 


so  that  knowing  E  /N_  and  the  dimensionaless  parameter  y2T  allows  explicit 
S  U  J  s 

evaluation  of  the  signal-to-interference  ratio  (SIR).  In  coded  systems  it 
is  the  quantity  E^/Nq  which  serves  as  the  independent  variable  with  E^RE^. 
Here  E^  is  the  energy  per  information  bit  and  R  is  the  code  rate  in  in¬ 
formation  bits  transmitted  per  channel  use.  It  follows  that  the  SIR  for  the 
J'th  interferer  is  then 


is. . s(  vv 

p  yfT  ’ 

J  J  s 


A  number  of  pulse  shapes  are  presently  available  in  the  ICS  for  simulat¬ 
ing  burst  or  impulsive  noise.  This  will  be  modified  to  include  Gaussian  bursts 
as  well  as  pulsed  CW.  It  is  also  intended  to  provide  a  continuous  Gaussian 
interference  capability  as  well  as  CW  at  a  specified  off  set  from  the  carrier. 
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In  the  latter  case  the  interference  signal  has  complex  envelope  of  the  form 


tsM  -  ^ 


J 2*f,t 


(12) 


where  is  the  corresponding  signal  power,  and  fj  is  the  frequency  offset 

in  Hz.  Actually  the  normalixed  quantity  f.T  ,  which  is  the  offset  normal- 

J  s 

ized  to  the  baud  rate,  will  be  specified.  The  interference  power  will  now 


be  adjusted  for  a  specified  value  of  where  now 

J  s 


Y 


2  .  _i  . 


i=l,2, . .  .-,J 


(13) 


The  SIR  is  then  given  by  (10)  for  uncoded  systems  and  by  (11)  for  coded 
systems. 

III.  Adaptive  Array  Simulation: 

The  signal  and  interference  models  have  been  described  in  the  preceding 
section.  In  the  present  section  we  describe  the  adaptive  array  processing 
structures  to  be  studied  both  analytically  and  through  simulation  as  part  of 
ongoing  research  efforts.  We  assume  that  the  appropriate  array  element  out¬ 
puts  are  generated  as  illustrated  in  Fig.  1.  Here  the  modulation  type,  E  /N 

s  u 

and  transmitter  filtering  parameters  are  chosen  exactly  as  in  the  present  ICS. 
The  choice  of  channel  models  now  includes;  AWGN,  impulsive,  and  fading/ 
dispersive  channel.  To  this  we  will  add  a  fourth  option  to  be  called 
spatially  distributed  interference  channel.  As  indicated  in  Fig.  1  the 
user  must  then  supply  the  array  parameters,  the  signal  location  and  the 
interference  parameters.  We  describe  each  of  these  quantities  separately. 

A May  PaSia/neXeAA  :  Here  we  must  specify  the  number  N  of  elements  as  well  as 
their  location  with  respect  to  a  phase  center  or  origin  of  spatial  coordinates. 
We  intend  to  restrict  attention  to  linear  arrays  consisting  of  no  more  than 

i 


N=8  elements.  The  location  of  each  element  vita  respect  to  the  origin 

vill  be  specified  in  terms  of  the  distance  d1  in  units  of  the  wavelength 

X  at  the  assumed  carrier  frequency.  The  elements  vill  all  be  assumed 
c 

omnidirectional 

SZgnaZ  Location:  We  assume  that  the  signal  location  is  specified  as  the 

angle  0  with  respect  to  broadside  as  illustrated  in  Fig.  2.  The  appro- 
s 

priate  element  delays  are  then  easily  calculated  as 

d! 

Ti0  =  sin6g  ;  i=l,2,...,Ii  ,  (lk) 

c 

where  the  proper  algebraic  signs  must  be  chosen  depending  upon  which  side 

of  the  phase  center  the  element  lies.  The  phase  factors  ^'ir*'cTio  “n  ^ 

can  then  be  computed  as  6.  ®±2ird'sin5  ,i«l,2,. . .  ,H  in  a  convenient  form 

XU  X  s 

independent  of  the  underlying  carrier  frequency. 


Iw^eAjjeAewce.  VivwmttVU,:  Ve  intend  to  limit  consideration  to  no  mere  than 
J=lt  interference  signals.  For  each  interferer  then  we  must  first  specify 


the  direction  6  with  respect  to  broadside  as  illustrated  in  Fig.  2.  This  then 


allows  evaluation  of  the  phase  factors  S^-2irfcT^  in  (7)  to  be  computed  as 


eij=±2ird^sin6jJi=l,2,. ..  ,H,  j=l,2,...,J.  Then  the  interference  type  must  be 


chosen.  Here  the  following  options  will  be  provided: 


1.  CW 

2.  Continuous  Gaussian 

3.  Pulsed  CW 

It.  Gaussian  Burst 
5.  Impulsive  Noise 


For  example,  in  the  case  of  impulsive  noise,  as  described  in  Part  IV  of  this 
report,  we  must  specify  the  following  quantitites: 


a.  Pulse  Arrivals:  The  parameter  v  specifying 
the  randomness  of  pulse  arrivals^and  the 

quantity  XT  specifying  pulse  arrival  rate. 

J  S 

b.  Pulse  Shape:  The  quantity  hQ  (t)  specifying  pulse 
shape  as  well  as  the  quantity^ajT.,  specifying 
pulse  duration. 

c.  Pulse  Amplitudes:  Here  we  must  specify  the  dis¬ 
tribution  of  pulse  amplitudes  and  any  scale  para¬ 
meters  required. 

d.  Pulse  Intensity:  Here  we  must  specify  the  quantity 

y^T  representing  the  interference  power  to 
background  noise  power. 

In  each  of  the  other  choices  of  interference  component  similar  parameters 
must  be  specified  as  discussed  previously  in  Part  IV. 

Having  discussed  how  the  element  outputs  are  to  be  generated,  we  now 
discuss  how  these  outputs  are  to  be  processed  in  an  adaptive  fashion.  The 
generic  receiver  structures  to  be  considered  in  this  study  are  illustrated 

in  Fig.  3.  Here  the  element  outputs  ?  (t),i=l,2 . N  are  applied  as  input 

to  a  weighting  network  which  adaptively  combines  these  inputs  to  produce  the 
complex  signal  ?(t) .  The  weighting  is  performed  under  control  of  an  appro¬ 
priately  defined  adaptive  algorithm.  We  intend  to  incorporate  a  variety  of 
adaptive  algorithms  into  the  ICS.  This  will  include: 

1.  The  least  mean-square  (LMS)  algorithm 

2.  The  generalized  signal-to-noise  ratio 
(GSNR)  algorithm. 

3.  The  maximum  likelihood  (ML)  algorithm. 

In  the  case  of  the  LMS  algorithm,  for  example  a  reference  input  d(t)  is 
required  to  develop  an  error  signal  S(t)=f (t)-d(t) .  We  propose 
obtaining  this  in  either  of  two  ways.  In  the  first,  called 
the  patched  reference  mode,  we  assume  a  "cheater"  line  is  available 
from  the  transmitter  so  that  the  reference  can  be  taken  as  the  actual 


transmitted  signal,  i.e.,  d(t)=s(t).  In  the  second  mode,  called  the 
decision-directed  mode,  the  desired  signal  is  obtained  through  a  decision 
feedback  scheme.  Here  the  array  output  r(t)  is  processed  in  a  temporal 
receiver  which  includes  possible  predetection  processing,  matched  filtering 
and  possibly  error-control  decoding.  The  output  decisions,  labeled  sL 
in  Fig.  3,  are  accepted  as  correct  and  passed  through  a  channel  model 
which  is  a  local  replica  of  the  channel  encoder  and  modulator/transmitter 
cascade  including  any  a  priori  known  signal  filtering  and/or  distortion 

A 

mechanisms.  The  output  of  this  model  s(t)  is  then  an  estimate  of  the  actual 
transmitted  signal  and  in  this  mode  we  choose  d(t)=s(t).  This  same  type  of 
decision-directed  processing  can  be  applied  to  other  adaptive  algorithms  as 
well  and  will  be  thoroughly  investigated  in  the  course  of  ongoing  research 
efforts. 

The  weighting  network  in  Fig.  3  will  be  assumed  of  the  form  illustrated 
in  Fig.  U.  Here  the  element  outputs  are  first  processed  in  a  spatial  corr¬ 
ection  filter  which  consists  of  a  bank  of  phase  shifters.  The  phase  shift 
introduced  on  the  i'th  branch  is  the  negative  of  0io=±2Trd^sin0s ,  i=l,2,...,N 

i 

which  was  the  phase  shift  experienced  by  the  signal  component  at  angle  0 


as  intercepted  by  the  i'th  element.  The  spatial  correction  filter  then 


filtering  function  H(ju;£)  is  known  for  a  fixed  value  of  c.  It  is  the 
parameter  vectors  2^,  i=l,2,...,K  which  are  controlled  by  the  adaptive 
algorithm.  We  will  be  particularly  interested  in  the  case  where  the  filters 
take  the  form  of  a  tapped  delay  line  or  transversal  filter  as  illustrated  in 
Fig.  5-  Here  the  parameters  2^,  i=l,2,...,N  represent  the  tap  weights  and 
as  illustrated  in  Fig.  5  the  parameter  vector  d . .  ,2L)  is  of  dimen¬ 
sion  L  while  L-l  delay  elements  of  T  seconds  each  is  required.  The  result¬ 
ing  transfer  function  is  of  the  form 

H(ju;5)  =  51+d2e'juT+23e‘,32L:T+....aLe'’3(L"l)“T 


£=1 


l 


(15) 


In  our  work  we  will  restrict  attention  to  a  number  of  weights  no  acre  than 
L=7.  Define  the  weight  vector  w  then  as  a  concatenation  of  the  2^  vectors 
such  that 

r-  2.1  t 
-2 


(16) 


L  Iw  J 


which  is  then  of  dimension  KL.  Under  the  limits  imposed  previously,  this 
dimension  can  be  no  greater  than  56,  although  use  of  much  smaller  dimensions 
will  generally  be  the  case.  Similarly,  if  we  define  the  complex  L- vectors 
as  x^(t  )*(x^(t)  ,x^(t-T) ,. . .  ,x^(t-tL-l)T))  and  then  the  NL-vector  X(t)  as 


the  array  output  ?'(t)  can  then  be  written  as 

?'(t)=  tfTx(t)  .  (18) 

The  linear  weighting  of  elements  of  the  vector  x(t)  is  then  in  a  form  such  that 
standard  adaption  algorithms  can  be  applied.  Finally,  we  note  from  Fig.  5 
that  the  actual  array  output  ?(t)  is  a  linearly  filtered  version  of  f'(t). 

This  post-array  filtering  could  clearly  have  been  incorporated  into  the  pre¬ 
detection  filtering  section  of  the  temporal  receiver  in  Fig.  3.  We  have 
chosen  to  place  it  here  to  provide  some  possible  noise  filtering  before 
applying  this  to  the  adaptive  algorithm  as  indicated  in  Fig.  3.  This 
filtering  earn  be  implemented  using  existing  filtering  capabilities  of  the  ICS. 

It  should  be  noted  at  this  time  that  there  is  no  unequivocal  guidance 
in  choosing  the  tap  delays  in  the  tapped-delay  line  filter.  Specifically,  if 
Bq  is  the  double-sided  equivalent  rectangular  bandwidth  of  the  signal  S(t) 
in  Hz  we  would  expect  that  the  sampling  interval  T  should  satisfy  T<l/2BQ 
and  since  BqTs>1  this  would  imply  that  T/Tg<l/2,  or  a  delay  no  greater  than 
half  a  baud  period.  In  the  existing  ICS  structure  it  is  convenient  to  take 
T/Ts=l/2  for  some  k>l  to  be  specified  by  the  user.  This  provision  will  then 
be  provided  as  a  user  option  otherwise  the  default  condition  k=l  will  be 
assumed. 

In  the  preceding  we  have  basically  described  how  the  DESIGN  Mode  of  the 
ICS  is  to  be  modified  to  allow  incorporation  of  an  adaptive  array  processing 
capability.  We  now  describe  how  the  other  operating  modes  are  to  be  modified. 
We  describe  the  VALIDATION  AND  SIMULATION  Modes  separately. 

VALIDATION  MOVE:  At  present  we  have  three  classes  of  waveform  plotting  options 
baseband  waveforms,  channel  waveforms,  and  synchronization  waveforms.  Within 
these  existing  options  we  will  add  the  option  of  plotting  the  prefiltered 


array  output  h'(t)  as  veil  as  the  actual  array  output  r(t)  under  the 
channel  -waveform  category.  Actually,  the  ability  to  plot  r(t)  is  already 
available  and  presently  labeled  "channel  output"  sc  this  presents  no  diffi¬ 
culties.  Similarly,  under  the  synchronization  waveform  category  in  is  de¬ 
sirable  to  add  the  provision  for  plotting  an  estimate  of  the  instantaneous 
SIR  as  a  function  of  time  and/or  snapshots  of  the  instantaneous  array  spatial 
pattern.  This  information  would  provide  some  understanding  of  the  "spatial 
synchronization"  properties  of  the  array  together  with  the  more  conventional 
time  and/or  phase  synchronization.  Since  we  propose  updating  the  weights 
no  faster  than  at  the  baud  rate"^  it  is  not  clear  that  enough  data  can  be 
collected  in  the  VALIDATION  Mode  to  demonstrate  any  significant  convergence 
characteristics.  This  information  can  best  be  collected  in  the  much  faster 
SIMULATION  Mode  and  is  described  in  somewhat  more  detail  below.  Nevertheless 
we  will  attempt  to  definitively  establish  the  feasibility  of  collecting  and 
displaying  this  information  in  the  VALIDATION  Mode  since  we  feel  this  would 
be  useful  for  demonstration  purposes. 

SIMULATION  MOVE:  Here  we  will  be  primarily  interested  in  collecting  data  on 
symbol  error  probability  Pg  and  bit  error  probability  in  terms  of  the  para 
meters  of  the  spatially  distributed  interference  channel  as  well  as  those 
of  the  adaptive  array.  We  will  also  be  interested  in  more  detailed  error 
statistics  such  as  the  error-free  gap  distribution,  the  C-allager  burst 
statistic,  etc.  Fortunately  this  capability  already  exists  in  the  ICS  so 
this  presents  no  difficulty. 


It  should  "be  noted  that  the  error  statistics  described  in  the  preceding 


are  obtained  by  averaging  over  all  modes  of  the  interference  and  are  not 
exclusively  concerned  vith  the  transient  behavior  of  the  adaptive  array 
as  is  most  work  in  this  area.  For  impulsive  or  burst  interference  character 
istics  the  latter  approach  simply  does  not  make  sense  since  the  adaptive 
array  is  generally  in  a  transient  state,  depending,  of  course,  on  the  number 
of  interferers  and  their  respective  pulse  arrival  rate.  For  purely  CW  or 
continuous  Gaussian  interference,  on  the  other  hand,  the  transient  behavior 
of  the  adaptive  array  night  well  be  a  matter  of  legitimate  concern,  although 
cuite  pheripheral  to  the  operating  error  characteristics  of  the  digital 
communications  link.  In  this  case  we  propose  allowing  rhe  array  tc  adapt 
first  during  an  initial  acquisition  or  training  period  of  specified  duration 
during  which  no  error  statistics  are  collected.  The  weights  can  then  either 
be  frozen  at  the  end  of  the  acquisition  period  or  allowed  to  continue  adapt¬ 
ing  intc  the  tracking  period  during  •which  error  statistics  ere  collected. 

An  acquisition  period  will  also  be  provided  for  impulsive  or  burst  inter¬ 
ference  although  in  this  case  it  makes  no  sense  to  freeze  the  weights  at 
the  end  of  this  period.  The  user  will  be  allowed  the  option  of  either 
the  patched  reference  or  decision-directed  adaptive  adjustment  node  during 
either  the  acquisition  or  tracking  modes.  This  is  entirely  consistent  with 
the  present  structure  and  operating  usage  of  the  ICS  and  should  present 
little  problem  in  implementation. 

Finally,  as  mentioned  previously,  we  would  like  to  have  the  ability  to 
plot  the  instantaneous  SIR  as  a  function  of  time  and/or  plot  snapshots  of 
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subsequent  plotting  in  the  ANALYSIS  Mode,  of  the  SIR  at  multiples  of  the 
array  update  rate.  This  can  be  calculated  from  knowledge  of  the  instant¬ 
aneous  array  weight  vector  and  the  signal/interference/sensor  scenario, 
although  this  is  yet  to  be  worked  out.  Similarly,  snapshots  of  the  array 
pattern  can  be  calculated  from  the  instantaneous  weight  vector.  These 
will  likewise  be  available  for  display  in  the  ANALYSIS  Mode  at  specified 
multiples  of  the  array  update  weight.  A  specific  scheme  for  accomplishing 
this  will  be  developed  and  reported  upon  at  some  later  time. 

IV .  Research  Directions: 

The  incorporation  of  an  adaptive  array  processing  capability  into  the 
ICS  is  not  to  be  considered  an  end  in  itself  but  rather  to  provide  a  com¬ 
prehensive  and  flexible  simulation  tool  as  an  adjunct  to  parallel  analytical 
efforts.  In  this  section  we  describe  some  initial  research  directions  which 
are  expected  to  make  extensive  use  of  the  ICS  in  this  capacity.  These  initial 
research  directions  will  include  the  following: 

1.  Uncoded  System  Performance:  We  will  be  specifically 
interested  in  the  operating  error  probability  per¬ 
formance  of  uncoded  digital  communication  systems 
for  various  signal/interference/sensor  scenarios. 

Of  particular  interest  will  be  impulsive  or  burst 
interference  channels .  While  the  SIR  and  total  SNR 
are  easily  calculated  these  quantities  do  not  deter¬ 
mine  error  rates  in  non-Gaussian  interference  environ¬ 
ments.  Explicit  calculation  of  error  rates  for  special 
cases  may  be  possible  but  exceedingly  tedious  and 
simulation  results  will  prove  quite  useful.  We  expect 
that  nonlinear  predetection  processing  in  the  temporal 
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receiver  will  be  required  to  mitigate  the  effects 
of  non-Gaussian  interference  bursts.  Experience 
has  indicated  that  broadband  spatial  nulls  vill  be 
required  if  spazial  filtering  is  to  be  effective  in 
this  environment.  We  vill  be  particularly  interest¬ 
ed  in  uncovering  any  particular  sensitivities  of 
specific  modulation  strategies  to  spatially  distri¬ 
buted  interference. 

Coded  System  Performance:  We  expect  that  the  most 
efficacious  solution  to  minimizing  the  effect  of 
spatially  distributed  interference  vill  consist  of 
a  judicious  combination  of  adaptive  spatial  filter¬ 
ing  and  either  error-control  and/or  spread-spectrum 
coding.  In  impulsive  or  burst  interference  channels 
ve  expect  that  the  nev  lov-rate  convolutional  code  construct 
ions  described  in  Part  III  and  vill  prove  particularly 
effective.  Similarly,  for  block  codes  the  reduced 
complexity  soft-decision  decoding  algorithms  in 
Part  II  should  prove  useful.  These  features,  to¬ 
gether  vith  a  limited  spread-spectrum  capability,  are 
presently  being  incorporated  into  the  ICS  and  some 
preliminary  results  for  single-element  reception 
are  reported  in  Part  I.  It  is  expected  that  these 
results  vill  provide  useful  insight  in  the  adaptive 
multi-element  case.  At  any  rate,  some  combination 
of  predetection  and/or  postdetection  nonlinear  pro¬ 
cessing  is  expected  to  be  required  in  the  temporal 
receiver. 

Adaptive  Algorithms:  As  noted  previously,  various 
adaptive  algorithms  vill  be  investigated  for  their 
suitability  in  this  application.  Of  particular 
interest  vill  be  the  performance  in  impulsive  or 
burst  noise  environments.  For  example,  should  some 


type  of  performance  monitor  be  provided  no  sense 
the  presence  or  absence  of  interference  ana  allow 
array  adaption  only  when  believed  present?  How- 
should  a  performance  monitor  be  implemented?  Are 
specific  adaptive  algorithms  particularly  sensitive 
to  spatially  distributed  interference  of  the  impul¬ 
sive  or  burst  variety?  These  and  other  questions 
are  to  be  thoroughly  explored. 

h.  Behavior  of  Ancillary  Systems :  A  number  of  ancill¬ 
ary  systems  such  as  bit  synchronizers  and  phase 
trackers  are  required  as  part  of  a  functioning  dig¬ 
ital  communications  receiver.  Our  interest  will 
be  in  investigating  the  behavior  of  these  ancillary 
systems  in  the  presence  of  spatially  distributed 
interference  and  adaptive  array  processing.  Alter¬ 
native  design  strategies  which  provide  more  robust 
performance  in  this  environment  will  be  developed 
and  investigated  through  simulation.  The  results 
should  prove  useful  in  the  development  of  integrat¬ 
ed  spatial/temporal  processor  for  digital  communi¬ 
cations  in  spatially  distributed  interference 
channels. 

V.  Summary  and  Conclusions: 

We  have  provided  an  outline  of  how  an  adaptive  array  processing  cap¬ 
ability  can  be  incorporated  into  the  existing  ICS  and  have  described  some 
initial  research  directions  that  will  make  extensive  use  of  this  capability. 
The  structure  described  here,  with  appropriate  modification  as  required, 
will  be  incorporated  into  the  ICS  as  part  of  ongoing  efforts.  Explicit 
documentation,  as  well  as  simulation  results  will  be  reported  at  frequent 


intervals. 
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Interference  Directions. 


Figure  3 

Adaptive  Array  Processing  Configuration 


Figure  4 

Structure  of  Element  Weighting  Network 
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